google-research-datasets / seegullLinks
SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 different geo-political regions across 6 continents, as well as state-level identities within the US and India.
☆36Updated last year
Alternatives and similar repositories for seegull
Users that are interested in seegull are comparing it to the libraries listed below
Sorting:
- Repository for research in the field of Responsible NLP at Meta.☆202Updated 2 months ago
- Code for Multilingual Eval of Generative AI paper published at EMNLP 2023☆70Updated last year
- Resources for cultural NLP research☆101Updated 3 months ago
- ☆66Updated 2 years ago
- Code and Resources for the paper, "Better to Ask in English: Cross-Lingual Evaluation of Large Language Models for Healthcare Queries"☆16Updated last year
- StereoSet: Measuring stereotypical bias in pretrained language models☆186Updated 2 years ago
- TimeLMs: Diachronic Language Models from Twitter☆109Updated last year
- A Python package to compute HONEST, a score to measure hurtful sentence completions in language models. Published at NAACL 2021.☆21Updated 4 months ago
- ☆41Updated 2 years ago
- A curated list of research papers and resources on Cultural LLM.☆46Updated 10 months ago
- Ensembling Hugging Face transformers made easy☆63Updated 2 years ago
- 💬 Language Identification with Support for More Than 2000 Labels -- EMNLP 2023☆147Updated 2 months ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆103Updated last year
- Code for our WOAH@ACL 2021 Paper on Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in …☆29Updated 3 years ago
- ☆217Updated last week
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆14Updated 3 weeks ago
- Collection of NLP model explanations and accompanying analysis tools☆144Updated 2 years ago
- Repository for the Dynamically Generated Hate Speech Dataset by Vidgen et al. (2021).☆43Updated 2 months ago
- Detecting Bias and ensuring Fairness in AI solutions☆98Updated 2 years ago
- ☆104Updated 7 months ago
- This repository contains the HiNER dataset released with our paper at LREC 2022☆15Updated 2 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆14Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆58Updated last year
- [Data + code] ExpertQA : Expert-Curated Questions and Attributed Answers☆131Updated last year
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆143Updated 7 months ago
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.☆31Updated 2 years ago
- Data for evaluating gender bias in coreference resolution systems.☆79Updated 6 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated 10 months ago
- To analyze and remove gender bias in coreference resolution systems☆79Updated 3 months ago
- Attribute statements generated by LLMs to preceding tokens using attention weights.☆17Updated 3 months ago