yhyu13 / C51-DDPG
This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)
☆10Updated 7 years ago
Alternatives and similar repositories for C51-DDPG:
Users that are interested in C51-DDPG are comparing it to the libraries listed below
- PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.☆15Updated 7 years ago
- A comparison of parameter space noise methods for exploration in deep reinforcement learning☆27Updated 5 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆93Updated 2 years ago
- Meta Reinforcement Learning Experiments☆33Updated 7 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- ☆81Updated 3 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆66Updated 6 years ago
- High-quality implementations of deep reinforcement learning algorithms for experiments☆51Updated 5 months ago
- ☆35Updated 6 years ago
- ☆44Updated 6 years ago
- Proximal Policy Optimization with Stein Control Variates:☆33Updated 6 years ago
- The state-of-art deep rl algorithms for Montezuma's revenge☆25Updated 6 years ago
- PyTorch implementation of Proximal Policy Optimization☆50Updated 7 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆151Updated 7 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆71Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Updated 4 years ago
- ☆41Updated 6 years ago
- Reinforcement Learning and Deep Learning Resources☆16Updated 6 years ago
- Random Network Distillation(RND) algo in Pytorch☆48Updated 5 years ago
- PyTorch IMPALA implementation☆25Updated 5 years ago
- Implementation of Deepmind's Neural Episodic Control☆58Updated 6 years ago
- Energy-Based Hindsight Experience Prioritization (CoRL 2018) Oral presentation (7%)☆33Updated 6 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆94Updated 4 years ago
- Guided-Meta Policy Search☆41Updated 2 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆31Updated 3 years ago
- Exploration based Reinforcement Learning. (Montezuma Revenge)☆14Updated 6 years ago
- Self-implemented code for Model-Based Meta-Reinforcement Learning☆17Updated 5 years ago
- ☆43Updated 7 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆80Updated 7 years ago
- Efficient Exploration through Bayesian Deep Q-Networks☆37Updated 6 years ago