qlan3 / ExplorerLinks
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆92Updated last month
Alternatives and similar repositories for Explorer
Users that are interested in Explorer are comparing it to the libraries listed below
Sorting:
- Code for MOPO: Model-based Offline Policy Optimization☆179Updated 3 years ago
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆161Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆138Updated 2 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆125Updated 4 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 2 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆70Updated last year
- ☆61Updated 6 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆100Updated 3 years ago
- ☆130Updated 10 months ago
- ☆53Updated last year
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆147Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated last year
- ☆196Updated 2 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆102Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated 2 months ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆160Updated 4 years ago
- Model-Based Offline Reinforcement Learning☆50Updated 4 years ago
- ☆112Updated 2 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆86Updated 3 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆92Updated 2 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆168Updated 3 years ago
- Random network distillation on Montezuma's Revenge and Super Mario Bros.☆49Updated 3 weeks ago
- RE3: State Entropy Maximization with Random Encoders for Efficient Exploration☆68Updated 3 years ago
- A Tensorflow implementation of the Option-Critic Architecture☆72Updated 8 years ago
- ☆53Updated 4 years ago
- pytorch-implementation of Dreamer (Model-based Image RL Algorithm)☆166Updated 4 months ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆56Updated 2 years ago
- ☆91Updated last year
- Soft Actor-Critic☆147Updated 7 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆128Updated 10 months ago