google-research-datasets / seegullLinks

SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 different geo-political regions across 6 continents, as well as state-level identities within the US and India.

☆37

Alternatives and similar repositories for seegull

Users that are interested in seegull are comparing it to the libraries listed below

Sorting:

facebookresearch / ResponsibleNLP
Repository for research in the field of Responsible NLP at Meta.
☆202Updated 6 months ago
simran-khanuja / awesome-cultural-nlp
Resources for cultural NLP research
☆107Updated last month
huggingface / that_is_good_data
☆65Updated 2 years ago
microsoft / Multilingual-Evaluation-of-Generative-AI-MEGA
Code for Multilingual Eval of Generative AI paper published at EMNLP 2023
☆70Updated last year
mainlp / awesome-human-label-variation
A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …
☆94Updated last year
dmg-illc / JUDGE-BENCH
☆36Updated 3 months ago
faridlazuarda / cultural-llm-papers
A curated list of research papers and resources on Cultural LLM.
☆52Updated last year
moinnadeem / StereoSet
StereoSet: Measuring stereotypical bias in pretrained language models
☆192Updated 2 years ago
mega002 / lm-debugger
The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.
☆180Updated 3 years ago
MilaNLProc / honest
A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.
☆20Updated 7 months ago
kayoyin / interpret-lm
Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)
☆62Updated 3 years ago
inseq-team / inseq
Interpretability for sequence generation models 🐛 🔍
☆445Updated 3 weeks ago
zhijing-jin / NLP4SocialGood_Papers
A reading list of up-to-date papers on NLP for Social Good.
☆304Updated 2 years ago
cardiffnlp / timelms
TimeLMs: Diachronic Language Models from Twitter
☆111Updated last year
MaLA-LM / GlotEval
GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way
☆14Updated 2 weeks ago
chaitanyamalaviya / ExpertQA
[Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers
☆135Updated last year
allenai / wimbd
What's In My Big Data (WIMBD) - a toolkit for analyzing large text datasets
☆223Updated last year
claws-lab / XLingEval
Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"
☆19Updated last year
HrishikeshVish / Fairpy
☆25Updated last year
g8a9 / ferret
A python package for benchmarking interpretability techniques on Transformers.
☆212Updated last year
rudinger / winogender-schemas
Data for evaluating gender bias in coreference resolution systems.
☆81Updated 6 years ago
dreji18 / Fairness-in-AI
Detecting Bias and ensuring Fairness in AI solutions
☆102Updated 2 years ago
McGill-NLP / bias-bench
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
☆150Updated 3 months ago
mt-upc / transformer-contributions
Measuring the Mixing of Contextual Information in the Transformer
☆32Updated 2 years ago
cambridgeltl / composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆75Updated last year
bminixhofer / zett
Code for Zero-Shot Tokenizer Transfer
☆141Updated 10 months ago
timoschick / self-debiasing
This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".
☆88Updated 4 years ago
aviaefrat / lmentry
☆14Updated 2 years ago
ZurichNLP / mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆60Updated last year
SALT-NLP / implicit-hate
☆41Updated 2 years ago