facebookresearch / controllable_agentLinks
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning or fine-tuning. Training is reward-free and based on the Forward-Backward representation.
☆64Updated last year
Alternatives and similar repositories for controllable_agent
Users that are interested in controllable_agent are comparing it to the libraries listed below
Sorting:
- ☆46Updated 2 years ago
- Deep Hierarchical Planning from Pixels☆104Updated 2 years ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆115Updated 3 years ago
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆81Updated 2 years ago
- Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"☆57Updated last year
- Reinforcement Learning via Supervised Learning☆71Updated 3 years ago
- Public code for "Reinforcement Learning from Passive Data via Latent Intentions"☆89Updated last year
- Fast reinforcement learning research☆61Updated 7 months ago
- Discovering and Achieving Goals via World Models, NeurIPS 2021☆85Updated last year
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆70Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆80Updated 2 years ago
- Skeleton for scalable and flexible Jax RL implementations☆83Updated 2 years ago
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆78Updated 3 years ago
- My Body Is A Cage☆41Updated 4 years ago
- Predictable MDP Abstraction for Unsupervised Model-Based RL (ICML 2023)☆32Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆145Updated 2 years ago
- ☆31Updated 2 years ago
- EARL: Environment for Autonomous Reinforcement Learning☆37Updated 2 years ago
- Conservative Q learning in Jax☆54Updated 2 years ago
- CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL☆114Updated 10 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆89Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆103Updated last year
- Sandbox environment for generalizable agent research☆25Updated 2 years ago
- ☆31Updated last year
- Extreme Q-Learning: Max Entropy RL without Entropy☆87Updated 2 years ago
- Official code for "Can Wikipedia Help Offline Reinforcement Learning?" by Machel Reid, Yutaro Yamada and Shixiang Shane Gu☆105Updated 2 years ago
- General Modules for JAX☆66Updated 3 months ago
- Generalized Decision Transformer for Offline Hindsight Information Matching (ICLR2022)☆67Updated 2 years ago
- a simple and scalable agent for training adaptive policies with sequence-based RL☆131Updated last week