sileod / llm-theory-of-mindLinks
Testing Theory of Mind (ToM) in language models with epistemic logic
☆17Updated last year
Alternatives and similar repositories for llm-theory-of-mind
Users that are interested in llm-theory-of-mind are comparing it to the libraries listed below
Sorting:
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated last year
- Code for the ICLR 2024 paper "How to catch an AI liar: Lie detection in black-box LLMs by asking unrelated questions"☆72Updated last year
- ☆36Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- ☆35Updated last month
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated 2 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆86Updated last year
- ☆24Updated last year
- [NeurIPS 2023] PyTorch code for Can Language Models Teach? Teacher Explanations Improve Student Performance via Theory of Mind☆66Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆109Updated 9 months ago
- Source code and data for The Magic of IF: Investigating Causal Reasoning Abilities in Large Language Models of Code (Findings of ACL 2023…☆29Updated 2 years ago
- ☆42Updated 6 months ago
- ☆26Updated 2 years ago
- Code of ICLR paper: https://openreview.net/forum?id=-cqvvvb-NkI☆94Updated 2 years ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆12Updated 2 years ago
- About The corresponding code from our paper " REFINER: Reasoning Feedback on Intermediate Representations" (EACL 2024). Do not hesitate t…☆70Updated last year
- A unified benchmark for math reasoning☆88Updated 2 years ago
- PASTA: Post-hoc Attention Steering for LLMs☆122Updated 9 months ago
- Neuro-Symbolic Integration Brings Causal and Reliable Reasoning Proofs☆38Updated last year
- ☆107Updated 6 months ago
- ☆27Updated last year
- A repository for transformer critique learning and generation☆90Updated last year
- ☆73Updated last year
- ☆22Updated 3 weeks ago
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆64Updated last year
- Based on the tree of thoughts paper☆48Updated 2 years ago
- ☆54Updated 2 years ago
- Few-shot Learning with Auxiliary Data☆31Updated last year
- Learning to route instances for Human vs AI Feedback (ACL 2025 Main)☆23Updated last month