uds-lsv / menyo-20k_MTLinks
MENYO-20k Corpus in "The Effect of Domain and Diacritics in Yorùbá-English Neural Machine Translation" in MT Summit 2021
☆11Updated 2 years ago
Alternatives and similar repositories for menyo-20k_MT
Users that are interested in menyo-20k_MT are comparing it to the libraries listed below
Sorting:
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Machine Translation (MT) Preparation Scripts☆32Updated last month
- A neural word aligner based on multilingual BERT☆352Updated 3 years ago
- A curated list of research papers and resources on code-switching☆320Updated 7 months ago
- Facebook Low Resource (FLoRes) MT Benchmark☆745Updated last year
- Neural Machine Translation (NMT) tutorial. Data preprocessing, model training, evaluation, and deployment.☆168Updated last year
- Machine Translation for Africa☆289Updated 3 years ago
- A tool that locates, downloads, and extracts machine translation corpora☆155Updated last month
- Yorùbá language training text for NLP, ASR and TTS tasks☆77Updated 2 years ago
- Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.☆158Updated last year
- Obtain Word Alignments using Pretrained Language Models (e.g., mBERT)☆366Updated last year
- OpusFilter - Parallel corpus processing toolkit☆106Updated 2 weeks ago
- A Neural Framework for MT Evaluation☆627Updated last month
- ☆54Updated 3 years ago
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- Machine translation (MT) benchmark dataset for languages in the Horn of Africa.☆40Updated 2 years ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆280Updated 5 months ago
- ☆109Updated last year
- Improved Sentence Alignment in Linear Time and Space☆175Updated 2 years ago
- A simple library for querying the URIEL typological database.☆91Updated last year
- Yet Another Neural Machine Translation Toolkit☆179Updated 4 months ago
- ☆17Updated 2 years ago
- How to finetune mbart using fairseq☆24Updated 4 years ago
- Tools and Modeling Code for the MASSIVE dataset☆547Updated 2 years ago
- m4Adapter: Multilingual Multi-Domain Adaptation for Machine Translation with a Meta-Adapter (Findings of EMNLP 2022)☆19Updated 2 years ago
- Bilingual term extractor☆54Updated last year
- ☆215Updated 3 weeks ago
- TSAR2022 Shared Task on Lexical Simplification - Datasets and Evaluation scripts☆10Updated 2 years ago
- The central repo for Creole based NLU and NLG work☆18Updated 2 months ago
- Neural CRF Model for Sentence Alignment in Text Simplification☆68Updated 6 months ago