google-research-datasets / seegullLinks
SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 different geo-political regions across 6 continents, as well as state-level identities within the US and India.
☆37Updated 2 years ago
Alternatives and similar repositories for seegull
Users that are interested in seegull are comparing it to the libraries listed below
Sorting:
- Resources for cultural NLP research☆112Updated 2 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆71Updated last year
- Repository for research in the field of Responsible NLP at Meta.☆204Updated 7 months ago
- ☆65Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆52Updated last year
- Benchmarking Large Language Models☆104Updated 5 months ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆19Updated last year
- FBI: Finding Blindspots in LLM Evaluations with Interpretable Checklists☆30Updated 4 months ago
- What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets☆224Updated last year
- ☆17Updated 2 years ago
- Interpretability for sequence generation models 🐛 🔍☆449Updated 2 weeks ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆194Updated 3 years ago
- Detecting Bias and ensuring Fairness in AI solutions☆102Updated 2 years ago
- A collaborative project to collect datasets in SEA languages, SEA regions, or SEA cultures.☆93Updated 10 months ago
- Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.☆121Updated 2 months ago
- A reading list of up-to-date papers on NLP for Social Good.☆304Updated 2 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Updated last year
- Efficiently find the best-suited language model (LM) for your NLP task☆132Updated 4 months ago
- [EMNLP'23] Official Code for "FOCUS: Effective Embedding Initialization for Monolingual Specialization of Multilingual Models"☆34Updated 6 months ago
- Measuring the Mixing of Contextual Information in the Transformer☆33Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆106Updated last year
- Code for WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.☆85Updated last year
- An Apache 2.0 fork of HuggingFace's Large Language Model Text Generation Inference☆19Updated last year
- ☆37Updated 4 months ago
- Crosslingual Question Answering for African Languages☆30Updated last year
- ☆25Updated last year
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- ☆118Updated last year
- A Multilingual Replicable Instruction-Following Model☆95Updated 2 years ago
- A python package for benchmarking interpretability techniques on Transformers.☆214Updated last year