valentinhofmann / dialect-prejudice
☆31Updated last month
Related projects ⓘ
Alternatives and complementary repositories for dialect-prejudice
- ☆196Updated 2 weeks ago
- ☆94Updated 6 months ago
- ☆20Updated last year
- Repository for the ACL 2024 conference website☆17Updated last month
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆62Updated last year
- The Prism Alignment Project☆37Updated 6 months ago
- 👩💻 Code for the ACL paper "Detecting Edit Failures in LLMs: An Improved Specificity Benchmark"☆20Updated 9 months ago
- ☆26Updated last month
- PAIR.withgoogle.com and friend's work on interpretability methods☆148Updated 2 weeks ago
- ☆74Updated last month
- Röttger et al. (2023): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆61Updated 10 months ago
- ☆86Updated 5 months ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆29Updated 8 months ago
- ☆70Updated 3 months ago
- ☆29Updated last year
- ☆109Updated last year
- A Benchmark of PDF Information Extraction Tools using a Multi-Task and Multi-Domain Evaluation Framework for Academic Documents☆19Updated last year
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆29Updated 8 months ago
- Repository for the Bias Benchmark for QA dataset.☆85Updated 10 months ago
- Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooM (CHI 2024 paper). LLooM automatically surfaces high-l…☆58Updated last week
- LLM experiments done during SERI MATS - focusing on activation steering / interpreting activation spaces☆78Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- Edu-ConvoKit: An Open-Source Framework for Education Conversation Data☆77Updated 3 months ago
- ☆21Updated 8 months ago
- This repository hosts the paper “LLM Based Math Tutoring: Challenges and Dataset”, along with the accompanying dataset. It explores the p…☆28Updated 2 months ago
- Official implementation of the ACL 2024: Scientific Inspiration Machines Optimized for Novelty☆68Updated 7 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆65Updated 3 years ago
- ☆32Updated last year
- A list of ethics related resources for researchers and practitioners of Natural Language Processing and Computational Linguistics☆30Updated 10 months ago
- ☆73Updated 4 months ago