aravindsrinivas / curl_rainbow
☆53Updated 4 years ago
Alternatives and similar repositories for curl_rainbow:
Users that are interested in curl_rainbow are comparing it to the libraries listed below
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆147Updated 3 years ago
- Pytorch implementation on OpenAI's Procgen ppo-baseline, built from scratch.☆30Updated 4 years ago
- ☆53Updated last year
- ☆194Updated 2 years ago
- ☆112Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆160Updated 3 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆166Updated 2 months ago
- ☆55Updated 2 years ago
- ☆26Updated 2 years ago
- Soft Actor-Critic☆144Updated 7 years ago
- Simple maze environments using mujoco-py☆54Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆177Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆214Updated 10 months ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 7 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆124Updated 4 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Updated last year
- ☆66Updated 4 years ago
- Code for "Offline Meta-Reinforcement Learning with Advantage Weighting" [ICML 2021]☆47Updated 2 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆92Updated 3 weeks ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- Random parameter environments using gym 0.7.4 and mujoco-py 0.5.7☆20Updated 6 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆56Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 2 years ago
- Implementation of the Option-Critic Architecture☆39Updated 6 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆28Updated 5 years ago
- Code for "Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills"☆37Updated 5 years ago
- PyTorch IMPALA implementation☆26Updated 5 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 5 years ago
- Conservative Q Learning on top of SAC☆130Updated 2 years ago
- ☆60Updated 6 years ago