locross93 / Hypothetical-Minds
Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind module that scaffolds the high-level planning process by generating, evaluating, and refining hypotheses about other agents’ strategies in natural language.
☆17Updated 3 months ago
Related projects ⓘ
Alternatives and complementary repositories for Hypothetical-Minds
- Repo to reproduce the First-Explore paper results☆36Updated last week
- Intelligent Go-Explore: Standing on the Shoulders of Giant Foundation Models☆45Updated 5 months ago
- ☆25Updated last week
- PyTorch Implementation of the paper "Towards Learning Abductive Reasoning using VSA Distributed Representations".☆12Updated 2 months ago
- Code for the "Cultural evolution in populations of Large Language Models" paper☆28Updated last week
- We develop world models that can be adapted with natural language. Intergrating these models into artificial agents allows humans to effe…☆19Updated 9 months ago
- [NeurIPS 2024] GTBench: Uncovering the Strategic Reasoning Limitations of LLMs via Game-Theoretic Evaluations☆49Updated 2 months ago
- Documentation for dynamic machine learning systems.☆27Updated last month
- [ICLR 2024] Trajectory-as-Exemplar Prompting with Memory for Computer Control☆49Updated 2 months ago
- Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"☆11Updated last week
- Generative cellular automaton-like learning environments for RL.☆19Updated last month
- Arrakis is a library to conduct, track and visualize mechanistic interpretability experiments.☆20Updated 3 months ago
- A Large Recurrent Action Model: xLSTM enables Fast Inference for Robotics Tasks☆16Updated last week
- Minimum Description Length probing for neural network representations☆16Updated last week
- OMNI-EPIC: Open-endedness via Models of human Notions of Interestingness with Environments Programmed in Code☆26Updated 2 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆84Updated 3 months ago
- ☆12Updated 3 months ago
- Automatic Integration for Neural Spatio-Temporal Point Process models (AI-STPP) is a new paradigm for exact, efficient, non-parametric inf…☆24Updated 3 weeks ago
- An OpenAI gym environment to evaluate the ability of LLMs (eg. GPT-4, Claude) in long-horizon reasoning and task planning in dynamic mult…☆62Updated last year
- Q-Probe: A Lightweight Approach to Reward Maximization for Language Models☆37Updated 4 months ago
- Explainable Reinforcement Learning (XRL) Resources☆33Updated last month
- ☆48Updated 4 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆13Updated last week
- A benchmark for evaluating learning agents based on just language feedback☆56Updated last month
- ☆73Updated 4 months ago
- This code accompanies the paper "Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration."☆22Updated 2 weeks ago
- ☆17Updated 4 months ago
- Clean RL implementation using MLX☆25Updated 8 months ago
- Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"☆31Updated last year
- ☆36Updated 3 months ago