BatsResearch / LexC-Gen
Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆15Updated 7 months ago
Alternatives and similar repositories for LexC-Gen
Users that are interested in LexC-Gen are comparing it to the libraries listed below
Sorting:
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated 11 months ago
- ☆34Updated 10 months ago
- A system for prompted weak supervision. Alfred is a powerful tool that leverages large language models to accelerate data annotation.☆54Updated last month
- ☆26Updated 5 months ago
- ☆65Updated last year
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆100Updated last year
- BLOOM+1: Adapting BLOOM model to support a new unseen language☆71Updated last year
- A Multilingual Replicable Instruction-Following Model☆93Updated last year
- ☆97Updated last year
- A curated list of research papers and resources on Cultural LLM.☆43Updated 7 months ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆23Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆128Updated last year
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆55Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- Crosslingual Reasoning through Test-Time Scaling☆14Updated this week
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- AIS is an evaluation framework for assessing whether the output of natural language models only contains information about the external w…☆31Updated 2 years ago
- ☆14Updated last year
- Apps built using Inspired Cognition's Critique.☆58Updated 2 years ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆50Updated 4 years ago
- The geometry of multilingual language model representations (EMNLP 2022).☆20Updated 2 years ago
- How do transformer LMs encode relations?☆48Updated last year
- ☆16Updated 3 years ago
- ☆48Updated 2 years ago
- Code for ACL 2022 paper "Expanding Pretrained Models to Thousands More Languages via Lexicon-based Adaptation"☆30Updated 3 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 9 months ago
- Public repository for SemEval 2023 - Task 10 - Explainable Detection of Online Sexism (EDOS)☆22Updated 2 years ago
- Statistics on multilingual datasets☆17Updated 2 years ago
- A corpus and code for understanding norms and subjectivity. 🤖☆49Updated 7 months ago
- UDapter is a multilingual dependency parser that uses "contextual" adapters together with language-typology features for language-specifi…☆31Updated 2 years ago