KurdishBLARK / InterdialectCorpus
A parallel corpus of Sorani, Kurmanji and English
☆11Updated 4 years ago
Alternatives and similar repositories for InterdialectCorpus:
Users that are interested in InterdialectCorpus are comparing it to the libraries listed below
- A morphological analyzer and spell checker for Kurdish in Hunspell☆29Updated 2 years ago
- A curated list of awesome resources and tools for Kurdish language and speech technologies☆57Updated last year
- The Kurdish Language Processing Toolkit☆95Updated 6 months ago
- Repository accompanying "An Open Dataset and Model for Language Identification" (Burchell et al., 2023)☆70Updated 11 months ago
- Creating super-parallel corpora of more than 1500+ unique languages for NLP research☆33Updated 2 years ago
- فەرهەنگیکی سەرچاوەکراوەی ئینگلیزی کوردی بۆ وۆردپرێس☆10Updated 2 years ago
- An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Span…☆77Updated 4 months ago
- OpusFilter - Parallel corpus processing toolkit☆104Updated this week
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆31Updated 3 weeks ago
- These are lists for a variety of languages containing words that are distinctive to each language.☆37Updated 2 years ago
- Efficient teacher-student models and scripts to make them☆50Updated last year
- A repository for resources in Kurdish Language☆18Updated 4 years ago
- Indian Language Tagger and Chunker (Hindi, Telugu, Tamil, Marathi, Punjabi, Kanada, Malayalam, Urdu, Bengali)☆40Updated 2 years ago
- Arabic Dialect Identification on AOC data.☆24Updated 6 years ago
- ☆72Updated last month
- ☆14Updated 2 years ago
- 3000+ machine-readable open source dictionaries distributed by the Applied Computational Linguistics lab at the University of Augsburg, G…☆11Updated last year
- Python Finite-State Toolkit☆53Updated last month
- AsoSoft Speech Corpus can be used for spoken language processing tasks in Central Kurdish such as speech recognition, speaker recognition…☆10Updated 3 years ago
- JavaScript tools for normalization and transliteration of Kurdish texts☆19Updated 2 years ago
- Caucasus languages focused multilingual and monolingual corpuses for Natural Language Processing(NLP)☆35Updated 4 months ago
- Coursera Corpus Mining and Multistage Fine-Tuning for Improving Lectures Translation☆14Updated 7 months ago
- PALI: Language identification for Perso-Arabic Scripts☆9Updated last year
- ☆9Updated last month
- A Language-Independent Unsupervised Morphological Segmentation Framework based on Adaptor Grammars☆17Updated 9 months ago
- Wiktra - Python tool of Wiktionary Transliteration modules for 514 languages and its 102 different scripts (orthographies)☆27Updated 3 years ago
- Helsinki Finite-State Technology (library and application suite)☆128Updated last month
- TUFS Asian Language Parallel Corpus☆50Updated last year
- MAMMOTH: MAssively Multilingual Modular Open Translation @ Helsinki☆22Updated last month
- LOW-RESOURCE NEURAL MACHINE TRANSLATION: A BENCHMARK FOR FIVE AFRICAN LANGUAGES☆15Updated 4 years ago