uds-lsv / afro-maftLinks
☆17Updated 2 years ago
Alternatives and similar repositories for afro-maft
Users that are interested in afro-maft are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆57Updated last year
- Crosslingual Question Answering for African Languages☆31Updated 9 months ago
- COMET for African languages☆10Updated 5 months ago
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- ☆109Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- Yet Another Neural Machine Translation Toolkit☆179Updated 4 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆280Updated 5 months ago
- ☆102Updated 7 months ago
- Some notebooks for NLP☆205Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆127Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆82Updated 10 months ago
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆59Updated 8 months ago
- Efficient Attention for Long Sequence Processing☆95Updated last year
- Machine Translation for Africa☆289Updated 3 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆160Updated 9 months ago
- Generate large textual corpora for almost any language by crawling the web☆12Updated last year
- This repository contains datasets and code for the paper "HINT3: Raising the bar for Intent Detection in the Wild" accepted at EMNLP-2020…☆33Updated 4 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- Open information and community for machine translation☆79Updated 2 weeks ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- ☆51Updated 2 years ago
- Reduce the size of pretrained Hugging Face models via vocabulary trimming.☆45Updated 2 years ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆142Updated last month
- POS for African languages☆17Updated 3 weeks ago
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆98Updated 2 years ago
- List of all the resources I developed in collaboration with LSV and Masakhane during my doctoral studies and beyond☆12Updated 2 years ago