BatsResearch / LexC-GenLinks
Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆18Updated last year
Alternatives and similar repositories for LexC-Gen
Users that are interested in LexC-Gen are comparing it to the libraries listed below
Sorting:
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆94Updated 11 months ago
- ☆231Updated 5 months ago
- Resources for cultural NLP research☆113Updated 3 months ago
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆16Updated 2 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆71Updated last year
- The FLORES+ Machine Translation Benchmark☆109Updated last year
- Utility for behavioral and representational analyses of Language Models☆172Updated 2 weeks ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆123Updated 2 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆107Updated last year
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Updated 2 years ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆35Updated 7 months ago
- The geometry of multilingual language model representations (EMNLP 2022).☆22Updated 3 years ago
- Crosslingual Question Answering for African Languages☆30Updated last year
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆97Updated last year
- ☆17Updated 2 years ago
- Interpretability for sequence generation models 🐛 🔍☆451Updated this week
- Find informative examples to efficiently (human)-evaluate NLG models.☆17Updated last month
- A curated list of research papers and resources on Cultural LLM.☆52Updated last year
- SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 di…☆37Updated 2 years ago
- MAFAND-MT☆60Updated last year
- ☆119Updated last year
- German Alpaca Dataset (Cleaned + Translated)☆26Updated 2 years ago
- ☆65Updated 2 years ago
- ☆55Updated 3 years ago
- Repository for research in the field of Responsible NLP at Meta.☆204Updated 7 months ago
- ☆102Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆304Updated 2 years ago
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- ☆24Updated 4 years ago