BatsResearch / LexC-Gen
Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆15Updated 6 months ago
Alternatives and similar repositories for LexC-Gen:
Users that are interested in LexC-Gen are comparing it to the libraries listed below
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆100Updated last year
- ☆26Updated 4 months ago
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.☆52Updated 3 weeks ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆55Updated 2 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- German Alpaca Dataset (Cleaned + Translated)☆24Updated 2 years ago
- ☆34Updated 10 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆30Updated 6 months ago
- Okapi: Instruction-tuned Large Language Models in Multiple Languages with Reinforcement Learning from Human Feedback☆94Updated last year
- ☆44Updated 2 years ago
- Can LLMs generate code-mixed sentences through zero-shot prompting?☆11Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- ☆14Updated last year
- Find informative examples to efficiently (human)-evaluate NLG models.☆10Updated last month
- A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.☆71Updated 8 months ago
- A curated list of research papers and resources on Cultural LLM.☆42Updated 7 months ago
- ☆65Updated last year
- Crosslingual Question Answering for African Languages☆29Updated 7 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆68Updated last year
- A software for transferring pre-trained English models to foreign languages