FerreroJeremy / Cross-Language-Dataset
A multilingual, multi-style and multi-granularity dataset for cross-language textual similarity detection
☆60Updated 7 years ago
Alternatives and similar repositories for Cross-Language-Dataset:
Users that are interested in Cross-Language-Dataset are comparing it to the libraries listed below
- Twpipe is a pipeline toolkit that parses raw tweets into universal dependencies.☆28Updated 5 years ago
- data and scripts for the shared task "Task 1: Paraphrase and Semantic Similarity in Twitter (PIT)" at SemEval 2015☆44Updated 4 years ago
- ☆24Updated 8 years ago
- ☆45Updated 7 years ago
- Large scale sentential paraphrases collection and annotation☆47Updated 2 years ago
- Bidirectional Long-Short Term Memory tagger (bi-LSTM) (in DyNet) -- hierarchical (with word and character embeddings)☆122Updated last year
- Uncovering divergent linguistic information in word embeddings with lessons for intrinsic and extrinsic evaluation☆63Updated 6 years ago
- An updated version of the Parser-v1 repo, used for Stanford's submission in the CoNLL17 shared task.☆47Updated 6 years ago
- Named Entity Disambiguation for Noisy Text☆67Updated 7 years ago
- Cross-Lingual Alignment of Contextual Word Embeddings☆99Updated 4 years ago
- State-of-the-art Supervised Sentence Simplification System from ACL 2014☆47Updated 6 years ago
- Text Simplification System and Dataset☆124Updated last year
- Tools for extracting parallel corpora from article titles across languages in Wikipedia☆72Updated 9 years ago
- Mining Discourse Markers for Unsupervised Sentence Representation Learning☆60Updated last year
- ROUGE summarization evaluation metric, enhanced with use of Word Embeddings☆22Updated 6 years ago
- Code for paper "End-to-End Non-Factoid Question Answering with an Interactive Visualization of Neural Attention Weights"☆66Updated 6 years ago
- Decoding platform for machine translation research☆54Updated 5 years ago
- pair2vec: Compositional Word-Pair Embeddings for Cross-Sentence Inference☆61Updated 2 years ago
- The Attract-Repel algorithm presented in (Mrkšić et al., TACL 2017), with accompanying resources.☆64Updated 7 years ago
- Coreference Resolution With Entity Equalization☆40Updated 2 years ago
- Python interface for converting Penn Treebank trees to Stanford Dependencies and Universal Depenencies☆71Updated 5 years ago
- ☆48Updated 7 years ago
- Parser for Abstract Meaning Representation☆46Updated 4 years ago
- Scripts for WASSA-2017 Shared Task on Emotion Intensity☆14Updated 7 years ago
- Preprocessing scripts to read definitions and other information from dictionaries☆22Updated 7 years ago
- Text generation with entities as context☆31Updated 6 years ago
- Sume is an implementation of the concept-based ILP model for summarization.☆38Updated 6 years ago
- Baseline models, training scripts, and instructions on how to reproduce our results for our state-of-art grammar correction system from M…☆69Updated 5 years ago