uds-lsv / afro-maftLinks
☆17Updated 2 years ago
Alternatives and similar repositories for afro-maft
Users that are interested in afro-maft are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆59Updated last year
- Crosslingual Question Answering for African Languages☆31Updated last year
- ☆115Updated last month
- COMET for African languages☆10Updated 9 months ago
- NTREX -- News Test References for MT Evaluation☆86Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆112Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- ☆115Updated 11 months ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- Yet Another Neural Machine Translation Toolkit☆180Updated 8 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆284Updated last month
- ☆220Updated 3 months ago
- ☆52Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆49Updated last year
- Machine Translation for Africa☆298Updated 3 years ago
- Some notebooks for NLP☆207Updated 2 years ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Updated last year
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆61Updated last year
- Code and models used in "MUSS Multilingual Unsupervised Sentence Simplification by Mining Paraphrases".☆99Updated 2 years ago
- Pipeline for pulling and processing online language model pretraining data from the web☆178Updated 2 years ago
- Agile reading group that works☆13Updated 3 years ago
- Efficient Attention for Long Sequence Processing☆97Updated last year
- SIB-200: A Simple, Inclusive, and Big Evaluation Dataset for Topic Classification in 200+ Languages and Dialects☆22Updated 9 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆171Updated 5 months ago
- The FLORES+ Machine Translation Benchmark☆109Updated last year
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- Experiments for XLM-V Transformers Integeration☆13Updated 2 years ago
- This dataset contains synthetic training data for grammatical error correction. The corpus is generated by corrupting clean sentences fro…☆161Updated last year