ahjwang / messenger-emma
Implements the Messenger environment and EMMA model.
☆23Updated last year
Alternatives and similar repositories for messenger-emma
Users that are interested in messenger-emma are comparing it to the libraries listed below
Sorting:
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 10 months ago
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆21Updated 4 years ago
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- ☆10Updated 2 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 4 years ago
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆16Updated 3 years ago
- ☆20Updated 2 years ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆33Updated last year
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- Phy-Q: A Testbed for Physical Reasoning☆44Updated 9 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 9 months ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- ☆41Updated 3 years ago
- Change-Based Exploration Transfer☆36Updated 3 years ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆104Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 2 years ago
- Exploring techniques to generate diverse conventions in multi-agent settings☆14Updated last year
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆56Updated 7 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆54Updated 3 years ago
- ☆15Updated 3 years ago
- Representation Learning in RL☆16Updated 2 years ago
- Code for "World Model as a Graph: Learning Latent Landmarks for Planning" (ICML 2021 Long Presentation)☆66Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- ☆31Updated 2 years ago
- Implementation of the Off Belief Learning algorithm.☆47Updated 2 years ago