BatsResearch / LexC-GenLinks
Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆18Updated last year
Alternatives and similar repositories for LexC-Gen
Users that are interested in LexC-Gen are comparing it to the libraries listed below
Sorting:
- ☆17Updated 3 years ago
- A reading list of up-to-date papers on NLP for Social Good.☆305Updated 2 years ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆72Updated last year
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆94Updated last year
- Utility for behavioral and representational analyses of Language Models☆177Updated last week
- ☆263Updated 6 months ago
- Resources for cultural NLP research☆113Updated 4 months ago
- Interpretability for sequence generation models 🐛 🔍☆453Updated last week
- Repository for research in the field of Responsible NLP at Meta.☆205Updated last week
- The FLORES+ Machine Translation Benchmark☆110Updated last year
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆94Updated 6 months ago
- A Multilingual Replicable Instruction-Following Model☆96Updated 2 years ago
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆20Updated 10 months ago
- Crosslingual Reasoning through Test-Time Scaling☆20Updated 8 months ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆186Updated 2 months ago
- ☆45Updated 3 years ago
- TimeLMs: Diachronic Language Models from Twitter☆112Updated last year
- AfroLID, a powerful neural toolkit for African languages identification which covers 517 African languages.☆35Updated last week
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆410Updated last year
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆98Updated last year
- Find informative examples to efficiently (human)-evaluate NLG models.☆18Updated this week
- ☆24Updated 4 years ago
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆87Updated last year
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆96Updated 2 years ago
- Multilingual Large Language Models Evaluation Benchmark☆133Updated last year
- Jojajovai Guarani-Spanish Parallel Corpus☆18Updated 3 years ago
- Some notebooks for NLP☆207Updated 2 years ago
- Data for evaluating gender bias in coreference resolution systems.☆81Updated 6 years ago
- ☆117Updated 3 months ago