FerreroJeremy / Cross-Language-DatasetLinks
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆60Updated 8 years ago
Alternatives and similar repositories for Cross-Language-Dataset
Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below
Sorting:
- Large scale sentential paraphrases collection and annotation☆46Updated 2 years ago
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- ☆33Updated 3 years ago
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 6 years ago
- Sume is an implementation of the concept-based ILP model for summarization.☆37Updated 6 years ago
- A curated question answering research dataset of factoid questions☆49Updated 5 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 5 years ago
- Code for paper "End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights"☆65Updated 7 years ago
- ☆24Updated 8 years ago
- semantic summarization using abstract meaning representation (AMR)☆74Updated 10 years ago
- ☆44Updated 7 years ago
- Named Entity Disambiguation for Noisy Text☆66Updated 7 years ago
- Code for EMNLP 2018 paper "Auto-Encoding Dictionary Definitions into Consistent Word Embeddings"☆36Updated 6 years ago
- A BiRNN framework implemented in Python and TensorFlow to extract parallel sentences from aligned comparable corpora.☆33Updated 6 years ago
- takahe is a multi-sentence compression module☆53Updated 3 years ago
- Multilingual hierarchical attention networks toolkit☆77Updated 5 years ago
- Unsupervised Multilingual Word Embeddings (EMNLP 2018)☆81Updated 3 years ago
- Semantic Textual Similarity in Python☆80Updated 8 years ago
- A Dependency Parser for Tweets☆78Updated 5 years ago
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆43Updated 4 years ago
- Datasets for Question Answering by Search and Reading☆69Updated 7 years ago
- ☆38Updated 8 years ago
- ☆43Updated 9 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆122Updated last year
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆73Updated 6 years ago
- Text Simplification Model based on Encoder-Decoder (includes Transformer and Seq2Seq) model.☆68Updated 2 years ago
- Easy-first dependency parser based on Hierarchical Tree LSTMs☆33Updated 8 years ago
- Formate converter from one type of qa task datasets to another type☆39Updated 6 years ago
- A transition-based parser for Universal Dependencies with BiLSTM word and character representations.☆82Updated 2 years ago
- List of NLP (Natural Language Processing) Corpora.☆63Updated 6 years ago