qlan3 / ExplorerLinks
Explorer is a PyTorch reinforcement learning framework for exploring new ideas.
☆95Updated last month
Alternatives and similar repositories for Explorer
Users that are interested in Explorer are comparing it to the libraries listed below
Sorting:
- ☆132Updated last year
- Code for MOPO: Model-based Offline Policy Optimization☆182Updated 3 years ago
- A list of papers regarding generalization in (deep) reinforcement learning☆152Updated last year
- Code for "Data-Efficient Reinforcement Learning with Self-Predictive Representations"☆161Updated 3 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 3 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆139Updated 2 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆125Updated 4 years ago
- ☆75Updated last year
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆192Updated 2 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆69Updated last year
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆161Updated 5 years ago
- ☆54Updated last year
- ☆61Updated 7 years ago
- This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …☆88Updated 4 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆149Updated 3 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆54Updated 4 months ago
- Simplified Action Decoder for Deep Multi-Agent Reinforcement Learning☆102Updated 3 years ago
- This repository contains the implementation for the paper - Exploration via Hierarchical Meta Reinforcement Learning.☆60Updated 6 years ago
- discrete soft Q learning(SQL) and soft Q imitation learning(SQIL) implementation in pytorch, simple!☆56Updated 2 years ago
- Gridworld for MARL experiments☆141Updated 4 years ago
- Combining Evolutionary Algorithms and deep RL in various ways☆103Updated 4 years ago
- ☆199Updated 2 years ago
- Decoupled Reward-free ExplorAtion and Execution for Meta-reinforcement learning☆90Updated 2 years ago
- Codes accompanying the paper "DOP: Off-Policy Multi-Agent Decomposed Policy Gradients" (ICLR 2021, https://arxiv.org/abs/2007.12322)☆51Updated 2 years ago
- ☆113Updated 2 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆96Updated 3 years ago
- ☆86Updated last year
- ☆53Updated 5 years ago
- Implementation of the Option-Critic Architecture☆40Updated 6 years ago
- Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)☆81Updated 2 years ago