msclar / symmtom
Code for the paper "Symmetric Machine Theory of Mind", presented at ICML 2022.
☆12Updated 2 years ago
Alternatives and similar repositories for symmtom:
Users that are interested in symmtom are comparing it to the libraries listed below
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆103Updated 2 years ago
- ☆24Updated 2 years ago
- Implements the Messenger environment and EMMA model.☆23Updated last year
- [ICLR 2022 Spotlight] Multi-Stage Episodic Control for Strategic Exploration in Text Games☆14Updated 2 years ago
- ☆13Updated 9 months ago
- Official code repository for Prompt-DT.☆101Updated 2 years ago
- The Implementation of "Machine Theory of Mind", ICML 2018☆22Updated 2 years ago
- A PyTorch Implementation of Skipper☆20Updated 3 months ago
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆25Updated last year
- Github repo for HIDIO: Hierarchical Reinforcement Learning by Discovering Intrinsic Options☆44Updated 3 years ago
- ☆28Updated 2 years ago
- ☆75Updated 6 months ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆12Updated last year
- Implementation of the Off Belief Learning algorithm.☆46Updated 2 years ago
- Advantage Leftover Lunch Reinforcement Learning (A-LoL RL): Improving Language Models with Advantage-based Offline Policy Gradients☆26Updated 4 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 6 months ago
- A reinforcement learning environment for the IGLU 2022 at NeurIPS☆33Updated last year
- ExORL: Exploratory Data for Offline Reinforcement Learning☆106Updated 2 years ago
- Overcooked-AI Experiment Psiturk Demo (for MTurk experiments)☆12Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 7 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆52Updated 3 months ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated 10 months ago
- Code for Sibling Rivalry and experiments presented in associated paper☆17Updated 3 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- ☆11Updated 2 years ago
- RAD: Reinforcement Learning with Augmented Data (code for procgen experiments)☆18Updated 3 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆54Updated 9 months ago
- Implementation of TWOSOME☆60Updated last week
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆32Updated last year
- The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…☆61Updated last year