msclar / symmtom
Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.
☆12Updated 2 years ago
Related projects: ⓘ
- Implements the Messenger environment and EMMA model.☆22Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆100Updated 2 years ago
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆13Updated 2 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12Updated 3 years ago
- ☆11Updated 5 months ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆17Updated 3 years ago
- ☆65Updated 2 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 2 months ago
- ☆23Updated last year
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆42Updated 2 years ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆24Updated last year
- Official code repository for Prompt-DT.☆93Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆62Updated 3 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆49Updated 8 months ago
- ☆44Updated last year
- Object Centric Atari games☆43Updated this week
- ☆11Updated last year
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆32Updated last year
- Pre-Trained Language Models for Interactive Decision-Making [NeurIPS 2022]☆116Updated 2 years ago
- A PyTorch Implementation of Skipper☆20Updated 7 months ago
- Implementation of TWOSOME☆42Updated 4 months ago
- Rewarded soups official implementation☆43Updated 11 months ago
- Knowledge-Aware RL agents with Commonsense Reasoning☆75Updated 2 years ago
- Code and data for "Inferring Rewards from Language in Context" [ACL 2022].☆15Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆24Updated last week
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 4 months ago
- Code for the paper: Dense Reward for Free in Reinforcement Learning from Human Feedback (ICML 2024) by Alex J. Chan, Hao Sun, Samuel Holt…☆17Updated last month
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆43Updated 6 months ago
- ☆19Updated 2 years ago
- ☆19Updated 2 years ago