ahjwang / messenger-emma
Implements the Messenger environment and EMMA model.
☆23Updated last year
Alternatives and similar repositories for messenger-emma:
Users that are interested in messenger-emma are comparing it to the libraries listed below
- Reward shaping approach for instruction following settings, leveraging language at multiple levels of abstraction.☆19Updated 3 years ago
- ☆20Updated 2 years ago
- Phy-Q: A Testbed for Physical Reasoning☆43Updated 7 months ago
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆54Updated 8 months ago
- Change-Based Exploration Transfer☆36Updated 2 years ago
- ☆11Updated 2 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Updated 3 years ago
- ☆22Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Updated 2 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 5 years ago
- Sandbox environment for generalizable agent research☆24Updated 2 years ago
- ☆16Updated 3 years ago
- Code for "Task-Agnostic Continual RL: In Praise of a Simple Baseline"☆33Updated last year
- Official code for "Pretraining Representations For Data-Efficient Reinforcement Learning" (NeurIPS 2021)☆52Updated 3 years ago
- Object Centric Atari games☆68Updated this week
- ☆40Updated 3 years ago
- A System for Morphology-Task Generalization via Unified Representation and Behavior Distillation (ICLR2023)☆13Updated 2 years ago
- Implementation of ICML 2023 paper: Future-conditioned Unsupervised Pretraining for Decision Transformer☆27Updated last year
- Author's PyTorch implementation of SR-DICE for marginalized importance sampling☆15Updated 3 years ago
- PyTorch Implementation of "Language as an Abstraction for Hierarchical Deep Reinforcement Learning" paper☆24Updated 3 years ago
- Efficient seed-parallel implementation of "Breaking the Replay Ratio Barrier"☆22Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Updated 2 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- Docker containers of baseline agents for the Crafter environment☆28Updated 3 years ago
- Mutual Information State Intrinsic Control (ICLR 2021 Spotlight)☆37Updated 4 years ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆54Updated 4 months ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Novelty MiniGrid--NovGrid--is an extension of MiniGrid environment that allows for the world properties and dynamics to change according …☆35Updated 9 months ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Updated 6 months ago