BatsResearch / LexC-GenLinks
Generate synthetic labeled data for extremely low-resource languages using bilingual lexicons.
☆18Updated 11 months ago
Alternatives and similar repositories for LexC-Gen
Users that are interested in LexC-Gen are comparing it to the libraries listed below
Sorting:
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆88Updated 7 months ago
- Resources for the "SummEval: Re-evaluating Summarization Evaluation" paper☆403Updated last year
- Resources for cultural NLP research☆103Updated 4 months ago
- ☆100Updated last year
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated last year
- ☆17Updated 2 years ago
- ☆54Updated 3 years ago
- A reading list of up-to-date papers on NLP for Social Good.☆304Updated 2 years ago
- Utility for behavioral and representational analyses of Language Models☆160Updated last month
- ☆37Updated 11 months ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Updated 3 years ago
- ☆218Updated last month
- Find informative examples to efficiently (human)-evaluate NLG models.☆16Updated last month
- ☆171Updated 6 years ago
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- Interpretability for sequence generation models 🐛 🔍☆438Updated this week
- OpenNyAI is a mission aimed at developing open source software and datasets to catalyze the creation of AI-powered solutions to improve a…☆41Updated last year
- A repository with several curated datasets of counter-narratives to fight online hate speech.☆90Updated 2 months ago
- Repository for XLM-T, a framework for evaluating multilingual language models on Twitter data☆158Updated 2 years ago
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆29Updated 3 years ago
- ☆110Updated 9 months ago
- Benchmarking Large Language Models☆99Updated 3 months ago
- ☆111Updated last year
- A collection of text simplification datasets and other resources☆48Updated last year
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- A curated list of research papers and resources on Cultural LLM.☆48Updated 11 months ago
- ☆45Updated 3 years ago
- TyDi QA contains 200k human-annotated question-answer pairs in 11 Typologically Diverse languages, written without seeing the answer and …☆313Updated 5 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆104Updated last year
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆93Updated last year