BunsenFeng / modular_pluralismLinks
Modular Pluralism @ EMNLP 2024
☆18Updated 9 months ago
Alternatives and similar repositories for modular_pluralism
Users that are interested in modular_pluralism are comparing it to the libraries listed below
Sorting:
- ☆51Updated last year
- ☆42Updated last year
- Official reposity for paper "High-Dimension Human Value Representation in Large Language Models" (NAACL'25 Main)☆23Updated 11 months ago
- ☆40Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆73Updated 3 months ago
- ☆29Updated last year
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆113Updated 9 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- ☆95Updated last year
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆38Updated last year
- ☆32Updated last year
- ☆44Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆112Updated last year
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆60Updated 7 months ago
- ☆18Updated last year
- ☆25Updated 2 weeks ago
- Function Vectors in Large Language Models (ICLR 2024)☆170Updated 2 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- ☆172Updated last year
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆59Updated last year
- ☆44Updated 3 months ago
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆29Updated 2 months ago
- ☆22Updated last year
- Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".☆78Updated last year
- Official code for "Decoding-Time Language Model Alignment with Multiple Objectives".☆25Updated 7 months ago
- ☆62Updated 2 years ago
- Augmenting Statistical Models with Natural Language Parameters☆27Updated 9 months ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆22Updated 3 years ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆108Updated last year
- ☆74Updated last year