FerreroJeremy / Cross-Language-DatasetLinks
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆60Updated 8 years ago
Alternatives and similar repositories for Cross-Language-Dataset
Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below
Sorting:
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 7 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- Semantic Textual Similarity in Python☆80Updated 8 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆123Updated 2 years ago
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- CogComp's light-weight Python NLP annotators☆115Updated 6 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Updated 7 years ago
- A curated question answering research dataset of factoid questions☆49Updated 6 years ago
- Language modeling scripts based on TensorFlow☆58Updated 6 years ago
- Pre-training character n-gram embeddings☆23Updated 2 years ago
- Tools for accessing Maluuba's Travel Dialogue Dataset☆76Updated 6 years ago
- utility class for building/evaluating document representations☆53Updated 5 years ago
- ☆44Updated 7 years ago
- Automatically exported from code.google.com/p/jacana☆37Updated 10 years ago
- ☆52Updated 7 years ago
- ☆40Updated 5 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 6 years ago
- Code for reproducing the results from the paper Few Shot Text Classification with a Human in the Loop☆90Updated 7 years ago
- Keras implementation of CoVe☆50Updated 7 years ago
- ☆48Updated 6 years ago
- Dict2vec is a framework to learn word embeddings using lexical dictionaries.☆115Updated 4 years ago
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆43Updated 5 years ago
- Pytorch implementation of "Get to the point: Get To The Point: Summarization with Pointer-Generator Networks"☆76Updated 8 years ago
- ☆25Updated 2 years ago
- An extension of word2vec to learn phrase embeddings☆76Updated 7 years ago
- ☆47Updated 7 years ago
- Code for paper "End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights"☆65Updated 7 years ago
- [NAACL 2019] code for "Pragmatically Informative Text Generation" https://arxiv.org/abs/1904.01301☆47Updated 5 years ago
- Stanford Sentiment Treebank loader in Python☆98Updated 5 years ago
- ☆47Updated 8 years ago