msclar / symmtom
Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.
☆11Updated 2 years ago
Alternatives and similar repositories for symmtom:
Users that are interested in symmtom are comparing it to the libraries listed below
- Implements the Messenger environment and EMMA model.☆23Updated last year
- A PyTorch Implementation of Skipper☆20Updated 4 months ago
- The Implementation of "Machine Theory of Mind", ICML 2018☆22Updated 2 years ago
- ☆20Updated 3 years ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆65Updated 3 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12Updated 3 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆102Updated 2 years ago
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆10Updated 4 years ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆26Updated last year
- ☆11Updated 2 years ago
- Using Natural Language for Reward Shaping in Reinforcement Learning☆23Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 2 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 9 months ago
- ☆35Updated 7 months ago
- ☆24Updated 2 years ago
- ☆38Updated 2 years ago
- Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)☆39Updated 4 years ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 7 months ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆44Updated 3 years ago
- ☆40Updated 3 years ago
- ☆19Updated 3 years ago
- Data synthesis code for "AGENT: A Benchmark for Core Psychological Reasoning"☆22Updated 2 years ago
- ☆27Updated last year
- Super fast implementations of common benchmark text world games☆45Updated 2 months ago
- Code repository for the NAACL 2025 paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language…☆28Updated 4 months ago
- Knowledge-Aware RL agents with Commonsense Reasoning☆75Updated 2 years ago
- PyTorch implementation for "On the Critical Role of Conventions in Adaptive Human-AI Collaboration", ICLR 2021☆16Updated 3 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆15Updated 2 years ago