uds-lsv / afro-maftLinks
☆17Updated 2 years ago
Alternatives and similar repositories for afro-maft
Users that are interested in afro-maft are comparing it to the libraries listed below
Sorting:
- MAFAND-MT☆57Updated last year
- Crosslingual Question Answering for African Languages☆31Updated 10 months ago
- ☆110Updated last year
- COMET for African languages☆10Updated 6 months ago
- NTREX -- News Test References for MT Evaluation☆84Updated last year
- ☆104Updated 7 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- Some notebooks for NLP☆207Updated last year
- Yet Another Neural Machine Translation Toolkit☆179Updated 5 months ago
- MasakhaNEWS: News Topic Classification for African Languages☆24Updated last year
- AfriBERTa: Exploring the Viability of Pretrained Multilingual Language Models for Low-resourced Languages☆74Updated 3 years ago
- Code repository for "Introducing Airavata: Hindi Instruction-tuned LLM"☆60Updated 9 months ago
- A library for preparing data for machine translation research (monolingual preprocessing, bitext mining, etc.) built by the FAIR NLLB te…☆282Updated 6 months ago
- ☆23Updated 2 years ago
- AfriSenti-SemEval Shared Task 12: Sentiment Analysis for African languages : https://afrisenti-semeval.github.io/☆47Updated last year
- German Alpaca Dataset (Cleaned + Translated)☆26Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- A repository for publicly/freely available Natural Language Processing (NLP) datasets for African languages.☆106Updated last year
- Fine-tuning Open-Source LLMs for Adaptive Machine Translation☆85Updated 3 weeks ago
- Machine Translation for Africa☆292Updated 3 years ago
- indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2☆129Updated last year
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆147Updated 2 months ago
- A simple semi-supervised approach for creating huggingface data script loaders and upload to the hub.☆11Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆82Updated 10 months ago
- Python-based implementation of the Translate-Align-Retrieve method to automatically translate the SQuAD Dataset to Spanish.☆59Updated 2 years ago
- Tools for managing datasets for governance and training.☆85Updated 2 months ago
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆41Updated last year
- This repository contains materials for the SIGIR 2022 tutorial on opinion summarization.☆34Updated 3 years ago
- Code and data for the IWSLT 2022 shared task on Formality Control for SLT☆21Updated 2 years ago