BatsResearch / LexC-GenLinks
Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆18Updated last year
Alternatives and similar repositories for LexC-Gen
Users that are interested in LexC-Gen are comparing it to the libraries listed below
Sorting:
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆92Updated 8 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- ☆220Updated 2 months ago
- Multilingual Large Language Models Evaluation Benchmark☆132Updated last year
- A reading list of up-to-date papers on NLP for Social Good.☆305Updated 2 years ago
- German Alpaca Dataset (Cleaned + Translated)☆26Updated 2 years ago
- ☆17Updated 2 years ago
- Resources for cultural NLP research☆104Updated 3 weeks ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Updated last year
- The FLORES+ Machine Translation Benchmark☆108Updated 11 months ago
- A curated list of research papers and resources on Cultural LLM.☆51Updated last year
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 di…☆36Updated 2 years ago
- Crosslingual Reasoning through Test-Time Scaling☆19Updated 5 months ago
- The geometry of multilingual language model representations (EMNLP 2022).☆22Updated 2 years ago
- ☆100Updated last year
- Repository for research in the field of Responsible NLP at Meta.☆202Updated 5 months ago
- Find informative examples to efficiently (human)-evaluate NLG models.☆16Updated last week
- The Benchmark of Linguistic Minimal Pairs☆154Updated 2 years ago
- ☆169Updated last year
- Interpretability for sequence generation models 🐛 🔍☆441Updated last month
- Utility for behavioral and representational analyses of Language Models☆163Updated 3 weeks ago
- ☆55Updated 3 years ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆97Updated 2 years ago
- Benchmarking Large Language Models☆99Updated 3 months ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆403Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆32Updated 7 months ago
- Data for evaluating gender bias in coreference resolution systems.☆80Updated 6 years ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆191Updated 2 years ago
- GEMBA — GPT Estimation Metric Based Assessment☆128Updated last year