Walter0807 / RepBelief
[ICML 2024] Language Models Represent Beliefs of Self and Others
☆32Updated 6 months ago
Alternatives and similar repositories for RepBelief:
Users that are interested in RepBelief are comparing it to the libraries listed below
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆62Updated 11 months ago
- [ACL 2024] Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language Models☆19Updated 9 months ago
- ☆44Updated 5 months ago
- Directional Preference Alignment☆57Updated 7 months ago
- Self-Supervised Alignment with Mutual Information☆17Updated 11 months ago
- ☆31Updated last year
- ☆22Updated 9 months ago
- ☆18Updated 5 months ago
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆25Updated last year
- ☆33Updated last month
- ☆14Updated 2 months ago
- ☆30Updated last year
- Code for "Seeking Neural Nuggets: Knowledge Transfer in Large Language Models from a Parametric Perspective"☆32Updated 11 months ago
- ☆16Updated 7 months ago
- Official implementation of Bootstrapping Language Models via DPO Implicit Rewards☆43Updated last week
- ☆13Updated 9 months ago
- ☆127Updated 9 months ago
- ☆25Updated 11 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆43Updated last year
- ☆40Updated 5 months ago
- Evaluate the Quality of Critique☆34Updated 10 months ago
- ☆21Updated 9 months ago
- Search, Verify and Feedback: Towards Next Generation Post-training Paradigm of Foundation Models via Verifier Engineering☆57Updated 4 months ago
- A Survey on the Honesty of Large Language Models☆57Updated 4 months ago
- ☆40Updated last year
- ☆18Updated 11 months ago
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆38Updated last year
- A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models☆19Updated 5 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆26Updated 3 weeks ago