facebookresearch / controllable_agentLinks
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.
☆64Updated last year
Alternatives and similar repositories for controllable_agent
Users that are interested in controllable_agent are comparing it to the libraries listed below
Sorting:
- ☆46Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆114Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Evaluating long-term memory of reinforcement learning algorithms☆143Updated 2 years ago
- Reinforcement Learning via Supervised Learning☆71Updated 3 years ago
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆78Updated 3 years ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- ☆43Updated 4 years ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆70Updated last year
- Deep Hierarchical Planning from Pixels☆103Updated 2 years ago
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- Codebase for "Uni[MASK]: Unified Inference in Sequential Decision Problems"☆55Updated 11 months ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆119Updated 9 months ago
- Reproduction of Dreamerv1 and v2 in pytorch for deepmind control suite☆39Updated 2 years ago
- ☆40Updated 3 years ago
- Code for "Masked Autoencoding for Scalable and Generalizable Decision Making". NeurIPS 2022☆44Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆103Updated last year
- My Body Is A Cage☆41Updated 4 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 3 years ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆56Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆113Updated 10 months ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆82Updated last year
- ☆70Updated 3 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago
- Change-Based Exploration Transfer☆35Updated 3 years ago