kzl / lifelong_rl
Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Reset-Free Lifelong Learning with Skill-Space Planning.
☆98Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for lifelong_rl
- ExORL: Exploratory Data for Offline Reinforcement Learning☆104Updated 2 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆156Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- ☆110Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆171Updated 2 years ago
- ☆53Updated 8 months ago
- CaDM: Context-aware Dynamics Model for Generalization in Model-based Reinforcement Learning☆63Updated 4 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆162Updated 2 years ago
- ☆44Updated last year
- Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective☆79Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆145Updated 3 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆98Updated 2 years ago
- ☆52Updated 4 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆82Updated 2 years ago
- Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …☆69Updated last year
- The MAGICAL benchmark suite for robust imitation learning (NeurIPS 2020)☆75Updated 11 months ago
- Deep Hierarchical Planning from Pixels☆90Updated last year
- Official Codebase for Offline Reinforcement Learning from Images with Latent Space Models☆28Updated 3 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆61Updated 4 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆87Updated 3 months ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Simple maze environments using mujoco-py☆52Updated 10 months ago
- Code for "Learning to Reach Goals via Iterated Supervised Learning"☆76Updated 2 years ago
- ☆41Updated 3 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆204Updated 5 months ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆65Updated 3 years ago
- Conservative Q learning in Jax☆50Updated last year
- Change-Based Exploration Transfer☆36Updated 2 years ago