A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆61May 29, 2017Updated 9 years ago
Alternatives and similar repositories for Cross-Language-Dataset
Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Multilingual and Multilevel Representation Learning Toolkit for NLP☆117Feb 14, 2018Updated 8 years ago
- Open-source implementation of the BilBOWA (Bilingual Bag-of-Words without Alignments) word embedding model.☆69Jul 28, 2021Updated 4 years ago
- Tool for Evaluating Multilingual WS-353 and SimLex-999☆10Dec 15, 2016Updated 9 years ago
- Train bilingual embeddings as described in our NAACL 2015 workshop paper "Bilingual Word Representations with Monolingual Quality in Mind…☆79Jun 15, 2019Updated 7 years ago
- Rust library for indexing and quickly searching large pretraining corpora☆31Oct 30, 2025Updated 8 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Atari gauntlet for RL agents☆29Mar 18, 2017Updated 9 years ago
- A framework to learn cross-lingual word embedding mappings☆654Apr 22, 2023Updated 3 years ago
- Crosslingual word embeddings described in our EMNLP paper☆16Sep 21, 2016Updated 9 years ago
- Code and data for EMNLP 2018 paper "Cross-lingual Lexical Sememe Prediction"☆19Nov 9, 2018Updated 7 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26May 4, 2017Updated 9 years ago
- Semantic Textual Similarity in Python☆80Jan 30, 2017Updated 9 years ago
- Implementation of a deep recursive net over binary parse trees (code for NIPS2014 paper)☆28Feb 6, 2015Updated 11 years ago
- A curated question answering research dataset of factoid questions☆49Nov 9, 2019Updated 6 years ago
- This project demonstrates the use of generic bi-directional LSTM models for predicting importance of words in a spoken dialgoue for under…☆11Mar 24, 2023Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Using embedding-based loss functions for phonetics/speech recognition.☆17Nov 24, 2014Updated 11 years ago
- [EMNLP 2021] MuVER: Improving First-Stage Entity Retrieval with Multi-View Entity Representations☆32May 23, 2022Updated 4 years ago
- This is an implementation of the Attention Sum Reader model as presented in "Text Comprehension with the Attention Sum Reader Network" av…☆98Sep 9, 2016Updated 9 years ago
- ☆10Sep 6, 2024Updated last year
- Experiments with an rCNN for scene labeling.☆15Mar 20, 2019Updated 7 years ago
- Structured Neural Networks for NLP: From Idea to Code☆59Dec 13, 2016Updated 9 years ago
- Scraps of random machine learning code☆15Oct 19, 2016Updated 9 years ago
- Reimplementation of Munkhdalai et al's Neural Semantic Encoders (https://arxiv.org/pdf/1607.04315v2.pdf)☆59Oct 28, 2016Updated 9 years ago
- Indonesian Resource Grammar (INDRA) - an implemented HPSG grammar for Indonesian☆15Mar 15, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆12Nov 1, 2025Updated 8 months ago
- Evaluation code and data for "Automatic Correction of Human Translations" [NAACL 2022].☆19Dec 9, 2022Updated 3 years ago
- [ACL 2018] Conditional Generators of Words Definitions☆33Jul 18, 2018Updated 7 years ago
- Attempts to create a state of the art language model on clinical and medical text data.☆12Oct 9, 2018Updated 7 years ago
- All My Pytorch projects reside here☆33Dec 10, 2017Updated 8 years ago
- TensorFlow implementation [ICLR 18] "Learning Approximate Inference Networks for Structured Prediction"☆30Jun 10, 2018Updated 8 years ago
- A curated list of resources related to temporal embeddings☆15Dec 14, 2018Updated 7 years ago
- ☆81Mar 8, 2014Updated 12 years ago
- ☆143Dec 31, 2019Updated 6 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Benchmark Datasets for BioNLP Tasks☆17May 7, 2025Updated last year
- This package supports implementation of anchor-based topic modeling and variants of the anchoring algorithm in Python 3.☆15Sep 17, 2018Updated 7 years ago
- Source code for paper "On the Pareto Front of Multilingual Neural Machine Translation" @ NeurIPS 2023☆17Sep 27, 2023Updated 2 years ago
- Unsupervised Cross-lingual Sentiment Analysis (CoNLL 2019)☆10Nov 4, 2019Updated 6 years ago
- Tensorflow Tutorial files and Implementations of various Deep NLP and CV Models.☆47Oct 3, 2016Updated 9 years ago
- TensorFlow code and pre-trained models for A Dynamic Word Representation Model Based on Deep Context. It combines the idea of BERT model…☆15Dec 27, 2018Updated 7 years ago
- Representation Learning of Entities and Documents from Knowledge Base Descriptions☆18Oct 6, 2018Updated 7 years ago