pokaxpoka / sunrise
SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning
☆119Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for sunrise
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆145Updated 3 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆171Updated 2 years ago
- ☆188Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆154Updated 2 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆147Updated last year
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆158Updated 4 years ago
- ☆117Updated 3 months ago
- ☆106Updated 4 years ago
- ☆110Updated last year
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆88Updated 2 months ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆156Updated 2 years ago
- Multi Task RL Baselines☆223Updated 2 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆53Updated 5 years ago
- ☆59Updated 6 years ago
- Conservative Q Learning on top of SAC☆119Updated 2 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆98Updated 2 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆162Updated 2 years ago
- ☆85Updated 3 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated 2 years ago
- Pytorch implementations of RL algorithms, focusing on model-based, lifelong, reset-free, and offline algorithms. Official codebase for Re…☆98Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆150Updated 2 weeks ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆186Updated last year
- PyTorch implementation of Stochastic Latent Actor-Critic(SLAC).☆87Updated 3 months ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆143Updated 3 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆83Updated 3 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆71Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆204Updated 5 months ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆181Updated last year
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆102Updated 2 years ago