marcelbinz / CENTaUR
☆25Updated 11 months ago
Alternatives and similar repositories for CENTaUR
Users that are interested in CENTaUR are comparing it to the libraries listed below
Sorting:
- ☆43Updated 2 weeks ago
- ☆18Updated 9 months ago
- ☆29Updated 3 weeks ago
- ☆132Updated 6 months ago
- Dataset and benchmark for assessing LLMs in translating natural language descriptions of planning problems into PDDL☆48Updated 7 months ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- The Prism Alignment Project☆75Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆75Updated last year
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- Interpretable text embeddings by asking LLMs yes/no questions (NeurIPS 2024)☆36Updated 6 months ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆29Updated 10 months ago
- maze datasets for investigating OOD behavior of ML systems☆44Updated 2 weeks ago
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆57Updated 2 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆26Updated 4 months ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 6 months ago
- General-Sum variant of the game Diplomacy for evaluating AIs.☆28Updated last year
- ☆92Updated 10 months ago
- We develop benchmarks and analysis tools to evaluate the causal reasoning abilities of LLMs.☆117Updated 11 months ago
- ☆69Updated last year
- ☆36Updated 7 months ago
- ☆94Updated 3 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated 11 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆146Updated 3 months ago
- Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"☆22Updated 3 years ago
- Probabilistic programming with large language models☆116Updated 3 weeks ago
- Language of thought library for python 3☆48Updated last year
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆129Updated 2 months ago
- Super fast implementations of common benchmark text world games☆47Updated 2 months ago
- ☆114Updated 9 months ago
- tomsup 👍 Theory of Mind Simulation using Python. A package that allows for easy agent-based modelling of recursive Theory of Mind☆68Updated last year