zzeng13 / DISCLinks
Automatic Idiomatic Expression Detection
☆13Updated 4 years ago
Alternatives and similar repositories for DISC
Users that are interested in DISC are comparing it to the libraries listed below
Sorting:
- ☆81Updated last week
- Efficient Low-Memory Aligner☆146Updated last year
- OpusFilter - Parallel corpus processing toolkit☆115Updated this week
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆54Updated 2 years ago
- An implementation of SpaCy(3.0)'s Matcher specifically designed for identifying English idioms.☆47Updated 10 months ago
- [EMNLP 2021] LM-Critic: Language Models for Unsupervised Grammatical Error Correction☆120Updated 4 years ago
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆386Updated 2 years ago
- A corpus of short answers written by learners of English and graded with CEFR levels☆12Updated 4 years ago
- Improved Sentence Alignment in Linear Time and Space☆188Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆162Updated last year
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆34Updated 3 years ago
- Sentence aligner☆124Updated 4 years ago
- cLang-8 is a dataset for grammatical error correction.☆112Updated 3 years ago
- Repository to collect and categorize Grammatical Error Correction papers.☆123Updated last week
- A tool that locates, downloads, and extracts machine translation corpora☆162Updated 4 months ago
- a tool for calcualting character n-gram F score☆77Updated 3 years ago
- A single model that parses Universal Dependencies across 75 languages. Given a sentence, jointly predicts part-of-speech tags, morphology…☆225Updated 3 years ago
- Multilingual sentence alignment using sentence embeddings☆139Updated last year
- Python framework for processing Universal Dependencies data☆59Updated last week
- MAGPIE: A sense-annotated corpus of potentially idiomatic expressions☆31Updated 5 years ago
- Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder.☆255Updated 3 years ago
- ☆67Updated 5 months ago
- ☆50Updated last year
- Repository for DISRPT2023 shared task☆17Updated last year
- A neural word aligner based on multilingual BERT☆370Updated 3 years ago
- XL-AMR is a sequence-to-graph cross-lingual AMR parser that exploits transfer learning (EMNLP2020).☆17Updated last year
- This packages up data for the Open Multilingual Wordnet☆60Updated last week
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆160Updated last year
- Transformer based translation quality estimation☆114Updated 2 years ago
- STREUSLE: a corpus with comprehensive lexical semantic annotation (multiword expressions, supersenses)☆69Updated 2 months ago