msclar / symmtom
Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.
☆12Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for symmtom
- Implements the Messenger environment and EMMA model.☆23Updated last year
- Implementation of TWOSOME☆49Updated 7 months ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆13Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆101Updated 2 years ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆32Updated last year
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12Updated 3 years ago
- Implementation of the Off Belief Learning algorithm.☆45Updated 2 years ago
- ☆24Updated 2 years ago
- ☆74Updated 4 months ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- ☆11Updated 7 months ago
- We perform functional grounding of LLMs' knowledge in BabyAI-Text☆221Updated 3 months ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆15Updated 2 years ago
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆42Updated 2 years ago
- Object Centric Atari games☆48Updated this week
- The Implementation of "Machine Theory of Mind", ICML 2018☆21Updated 2 years ago
- ☆35Updated 4 months ago
- Dataset collection and training code for "Ask Your Humans: Using Human Instructions to Improve Generalization in Reinforcement Learning"☆9Updated 3 years ago
- ☆18Updated last year
- Code for LaMPP: Language Models as Probabilistic Priors for Perception and Action☆35Updated last year
- Code and data for Learning Rewards from Linguistic Feedback, AAAI '21☆10Updated 3 years ago
- ☆19Updated 2 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- ☆19Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 2 months ago
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆116Updated 2 years ago
- Official code repository for Prompt-DT.☆98Updated 2 years ago
- Code repository for the paper "LLM-Coordination: Evaluating and Analyzing Multi-agent Coordination Abilities in Large Language Models"☆23Updated last month
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 6 months ago
- ☆15Updated 9 months ago