chrisvdweth / seleneLinks
An open, large-scale, interactive textbook.
☆47Updated last week
Alternatives and similar repositories for selene
Users that are interested in selene are comparing it to the libraries listed below
Sorting:
- Papers on fairness in NLP☆449Updated last year
- ☆51Updated last year
- ☆152Updated 2 years ago
- A resource repository for representation engineering in large language models☆138Updated 11 months ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆149Updated 2 months ago
- A reading list for papers on causality for natural language processing (NLP)☆669Updated 4 months ago
- Aligning AI With Shared Human Values (ICLR 2021)☆303Updated 2 years ago
- Training data extraction on GPT-2☆193Updated 2 years ago
- ☆21Updated last month
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆60Updated last year
- Python package for measuring memorization in LLMs.☆168Updated 3 months ago
- ☆164Updated 10 months ago
- This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…☆125Updated last year
- StereoSet: Measuring stereotypical bias in pretrained language models☆191Updated 2 years ago
- ☆25Updated 4 months ago
- ICLR 2025 Workshop & CHI 2025 SIG: "Bidirectional Human-AI Alignment"☆42Updated last year
- ☆31Updated 4 years ago
- ☆63Updated last year
- ☆18Updated 4 years ago
- A framework for assessing and improving classification fairness.☆33Updated 2 years ago
- [Preprint' 24] LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs☆11Updated last year
- ☆293Updated 2 months ago
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆88Updated 2 years ago
- Resources for cultural NLP research☆104Updated 3 weeks ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆21Updated 4 years ago
- ☆56Updated 2 years ago
- A reading list of up-to-date papers on NLP for Social Good.☆305Updated 2 years ago
- A toolkit to assess data privacy in LLMs (under development)☆62Updated 9 months ago
- ☆116Updated last year
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆80Updated 4 years ago