cicl-stanford / procedural-evals-tom
☆25Updated last year
Related projects: ⓘ
- ☆44Updated 8 months ago
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks☆19Updated last year
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆26Updated 2 months ago
- ☆25Updated 9 months ago
- ☆10Updated 4 months ago
- ☆46Updated 10 months ago
- Official code for paper Understanding the Reasoning Ability of Language Models From the Perspective of Reasoning Paths Aggregation☆13Updated 6 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆41Updated 9 months ago
- Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision☆78Updated last week
- Source codes for "Preference-grounded Token-level Guidance for Language Model Fine-tuning" (NeurIPS 2023).☆12Updated 10 months ago
- ☆25Updated 7 months ago
- ☆70Updated 10 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆49Updated 3 months ago
- ☆30Updated 7 months ago
- [ACL 2024 Findings] CriticBench: Benchmarking LLMs for Critique-Correct Reasoning☆20Updated 6 months ago
- Directional Preference Alignment☆44Updated 3 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆24Updated 2 months ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated last year
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆96Updated 7 months ago
- Repository (preliminary codes) for DSTC10 SIMMC track.☆19Updated last year
- Self-Explore to avoid ️the p️️it! Improving the Reasoning Capabilities of Language Models with Fine-grained Rewards☆39Updated 4 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆11Updated 2 months ago
- Data and code accompanying the paper "Reasoning about Goals, Steps, and Temporal Ordering with WikiHow"☆31Updated last year
- Domain-specific preference (DSP) data and customized RM fine-tuning.☆24Updated 6 months ago
- ☆36Updated 5 months ago
- This is code for most of the experiments in the paper Understanding the Effects of RLHF on LLM Generalisation and Diversity☆35Updated 8 months ago
- ☆11Updated last year
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models☆33Updated 9 months ago
- ☆61Updated 3 months ago
- Analyzing LLM Alignment via Token distribution shift☆13Updated 7 months ago