FerreroJeremy / Cross-Language-DatasetLinks
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆61Updated 8 years ago
Alternatives and similar repositories for Cross-Language-Dataset
Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below
Sorting:
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆122Updated 2 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- Semantic Textual Similarity in Python☆81Updated 8 years ago
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆43Updated 4 years ago
- A curated question answering research dataset of factoid questions☆49Updated 5 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- CogComp's light-weight Python NLP annotators☆115Updated 6 years ago
- An extension of word2vec to learn phrase embeddings☆75Updated 6 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- An attentional NMT model in Dynet☆26Updated 6 years ago
- Universal segmenter based on the Universal Dependency framework, written by Y. Shao, Uppsala University☆34Updated 6 years ago
- The WebSplit Benchmark introducing "Split and Rephrase" task☆63Updated 6 years ago
- ☆25Updated 2 years ago
- ☆24Updated 9 years ago
- ☆48Updated 6 years ago
- A python library to compute rouge score for summarization☆57Updated 2 years ago
- Modularizing Unsupervised Sense Embedding☆29Updated 7 years ago
- The Argument Reasoning Comprehension Task: Source codes & Datasets☆76Updated 3 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Updated 7 years ago
- ☆66Updated 2 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 7 years ago
- ☆47Updated 8 years ago
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆115Updated 4 years ago
- Datasets for Question Answering by Search and Reading☆70Updated 7 years ago
- utility class for building/evaluating document representations☆53Updated 5 years ago
- One million English sentences, each split into two sentences that together preserve the original meaning, extracted from Wikipedia edits.☆122Updated 6 years ago
- Stanford Sentiment Treebank loader in Python☆98Updated 5 years ago
- ☆125Updated 8 years ago
- Code for the paper "Extreme Adaptation for Personalized Neural Machine Translation"☆42Updated 3 years ago
- Neural SRL model☆71Updated 3 years ago