myracheng / lm_caricatureLinks
code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations
☆10Updated last year
Alternatives and similar repositories for lm_caricature
Users that are interested in lm_caricature are comparing it to the libraries listed below
Sorting:
- This repository contains data, code and models for contextual noncompliance.☆22Updated 10 months ago
- ☆78Updated 2 years ago
- ☆54Updated 2 weeks ago
- ☆29Updated last year
- Teaching Models to Express Their Uncertainty in Words☆39Updated 3 years ago
- code for EMNLP 2024 paper: How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for M…☆11Updated 6 months ago
- [ICLR 2025] Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization☆28Updated 4 months ago
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- Data Valuation on In-Context Examples (ACL23)☆23Updated 4 months ago
- ☆50Updated last year
- ☆29Updated last year
- ☆24Updated 8 months ago
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated last year
- ☆34Updated 3 years ago
- This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)☆101Updated 2 years ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago
- ☆44Updated 9 months ago
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆25Updated 9 months ago
- Align your LM to express calibrated verbal statements of confidence in its long-form generations.☆25Updated last year
- Providing the answer to "How to do patching on all available SAEs on GPT-2?". It is an official repository of the implementation of the p…☆11Updated 4 months ago
- Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments (Zhou et al., EMNLP 2024)☆13Updated 8 months ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆60Updated 2 years ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆86Updated last year
- A codebase for ACL 2023 paper: Mitigating Label Biases for In-context Learning☆10Updated last year
- [ACL 2020] Towards Debiasing Sentence Representations☆66Updated 2 years ago
- AbstainQA, ACL 2024☆25Updated 7 months ago
- A framework to train language models to learn invariant representations.☆14Updated 3 years ago