txzhao / rl-zooLinks
PyTorch implementation of various reinforcement learning algorithms
☆18Updated 7 years ago
Alternatives and similar repositories for rl-zoo
Users that are interested in rl-zoo are comparing it to the libraries listed below
Sorting:
- Bayesian Uncertainty Exploration in Deep Reinforcement Learning☆18Updated 7 years ago
- Code for training and testing a Hidden Parameter Markov Decision Process, used to facilitate the transfer of learning☆32Updated 7 years ago
- Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm☆44Updated 6 years ago
- Meta Reinforcement Learning Experiments☆34Updated 7 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.☆56Updated 6 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 7 years ago
- State Space Models for Reinforcement Learning in Tensorflow☆19Updated 6 years ago
- hierarchical Q-learning implementation☆11Updated 10 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 7 years ago
- hierarchical deep reinforcement learning algorithms☆41Updated 7 years ago
- Actor Critic using Kronecker-Factored Trust Region☆19Updated 6 years ago
- Contextual Bandits Action Elimination DQN☆21Updated 7 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Updated 2 years ago
- Exploration Strategies for Deep Reinforcement Learning☆39Updated 6 years ago
- Maximum Causal Entropy Inverse Reinforcement Learning☆47Updated 6 years ago
- Implementation of Deepmind's Neural Episodic Control☆58Updated 7 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆31Updated 6 years ago
- ☆43Updated 8 years ago
- PyTorch implementation of CommNet☆36Updated 7 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 6 years ago
- (Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards☆28Updated 6 years ago
- ICRL 2020☆19Updated 5 years ago
- ☆92Updated last year
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆22Updated 6 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆20Updated 8 years ago
- Robust policy search algorithms which train on model ensembles☆28Updated 8 years ago
- Distributed implementation of popular evolutionary methods☆64Updated 7 years ago
- TD-Regularized Actor-Critic Methods☆36Updated 5 years ago