JunsolKim / RepresentationPoliticalLLM
Kim, J., Evans, J., & Schein, A. (2025). Linear Representations of Political Perspective Emerge in Large Language Models. ICLR.
☆11Updated 3 weeks ago
Alternatives and similar repositories for RepresentationPoliticalLLM:
Users that are interested in RepresentationPoliticalLLM are comparing it to the libraries listed below
- ☆23Updated 2 years ago
- ☆22Updated last year
- ☆57Updated 4 months ago
- Code for "From Pretraining Data to Language Models to Downstream Tasks: Tracking the Trails of Political Biases Leading to Unfair NLP Mod…☆36Updated last year
- ☆106Updated 11 months ago
- Code and data for Marked Personas (ACL 2023)☆23Updated last year
- ☆16Updated 3 months ago
- ☆49Updated last year
- ☆131Updated last year
- Repository for the Bias Benchmark for QA dataset.☆113Updated last year
- ☆37Updated 5 months ago
- The Prism Alignment Project☆75Updated last year
- ☆18Updated 3 years ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆137Updated 4 months ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆80Updated 4 years ago
- ☆47Updated 3 years ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆31Updated 4 months ago
- ☆39Updated last year
- The official repo for SocKET: Social Knowledge Evaluation Tests☆23Updated last year
- Code and data for the EMNLP 2021 paper "Just Say No: Analyzing the Stance of Neural Dialogue Generation in Offensive Contexts". Coming so…☆17Updated last year
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- ☆11Updated 2 years ago
- ☆36Updated 2 years ago
- [NLP] Unsupervised User Stance Detection on Twitter.☆16Updated 2 years ago
- This repository provides details and links to the ACL anthology corpus/collection including .bib, .pdf and grobid extractions of the pdfs☆179Updated last year
- Resources for cultural NLP research☆92Updated this week
- Code and data for Koo et al's ACL 2024 paper "Benchmarking Cognitive Biases in Large Language Models as Evaluators"☆20Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- Code/data for MARG (multi-agent review generation)☆42Updated 5 months ago
- Code and data for the paper "Measuring Conversational Uptake: A Case-Study on Student-Teacher Interactions"☆24Updated 2 years ago