jianggy / MPI
This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models
☆49Updated last year
Alternatives and similar repositories for MPI:
Users that are interested in MPI are comparing it to the libraries listed below
- ☆27Updated last year
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆115Updated this week
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆57Updated 9 months ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆37Updated 7 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆115Updated 5 months ago
- Personalized Soups: Personalized Large Language Model Alignment via Post-hoc Parameter Merging☆99Updated last year
- A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.☆62Updated 3 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆31Updated 4 months ago
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- [EMNLP 2023, Findings] GRACE: Discriminator-Guided Chain-of-Thought Reasoning☆47Updated 4 months ago
- ☆30Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆108Updated last year
- Directional Preference Alignment☆56Updated 4 months ago
- ☆34Updated last year
- ☆89Updated last year
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆22Updated 11 months ago
- ☆20Updated last year
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆47Updated 2 months ago
- ☆44Updated 6 months ago
- ☆41Updated 3 months ago
- [EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions☆106Updated 5 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆50Updated 10 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆22Updated 7 months ago
- Lightweight Adapting for Black-Box Large Language Models☆19Updated last year
- Public code repo for paper "Aligning LLMs with Individual Preferences via Interaction"☆18Updated 4 months ago
- The repository of the project "Fine-tuning Large Language Models with Sequential Instructions", code base comes from open-instruct and LA…☆29Updated 2 months ago
- Evaluating the Ripple Effects of Knowledge Editing in Language Models☆53Updated 10 months ago
- [EMNLP Findings 2024 & ACL 2024 NLRSE Oral] Enhancing Mathematical Reasoning in Language Models with Fine-grained Rewards☆48Updated 9 months ago
- ☆24Updated 9 months ago
- ☆49Updated last year