KurdishBLARK / InterdialectCorpus
A parallel corpus of Sorani, Kurmanji and English
☆13Updated 4 years ago
Alternatives and similar repositories for InterdialectCorpus
Users that are interested in InterdialectCorpus are comparing it to the libraries listed below
Sorting:
- Rule-based Kurdish Transliterator☆9Updated last year
- A morphological analyzer and spell checker for Kurdish in Hunspell☆30Updated 3 years ago
- The Kurdish Language Processing Toolkit☆96Updated 8 months ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated last month
- ☆15Updated 3 years ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆35Updated 5 months ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆33Updated 2 years ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆74Updated last month
- These are lists for a variety of languages containing words that are distinctive to each language.☆38Updated 3 years ago
- A tool for converting TMX files into bilingual corpora☆18Updated 5 years ago
- OpusCleaner is a web interface that helps you select, clean and schedule your data for training machine translation models.☆52Updated 2 weeks ago
- Linguistically analyzed Classical Tibetan texts☆26Updated 3 years ago
- Code and Data for Paper "Controlling Styles in Neural Machine Translation with Activation Prompt" (ACL 2023 Findings)☆16Updated 2 years ago
- A tiny BERT for low-resource monolingual models☆31Updated 7 months ago
- Python Finite-State Toolkit☆54Updated last week
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago
- MorphyNet: a Large Multilingual Database of Derivational and Inflectional Morphology (+morpheme segmentation)☆45Updated 2 years ago
- Tools for filtering and cleaning parallel and monolingual corpora for machine translation and other natural language processing tasks.☆41Updated last year
- Efficient Low-Memory Aligner☆143Updated 4 months ago
- Zero-shot Transfer Learning from English to Arabic☆29Updated 2 years ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆30Updated 3 years ago
- Bicleaner fork that uses neural networks☆40Updated this week
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated 11 months ago
- Transformer based translation quality estimation☆111Updated last year
- JavaScript tools for normalization and transliteration of Kurdish texts☆19Updated 2 years ago
- Context-Sensitive Neural Spelling Checker☆20Updated 7 months ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆80Updated 5 months ago
- A tool that locates, downloads, and extracts machine translation corpora☆154Updated 3 weeks ago
- Repository to collect and categorize Grammatical Error Correction papers.☆118Updated last month
- Improved Sentence Alignment in Linear Time and Space☆171Updated 2 years ago