Silvicek / distributional-dqnView external linksLinks
Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regression' based on OpenAi DQN baselines.
☆133May 5, 2019Updated 6 years ago
Alternatives and similar repositories for distributional-dqn
Users that are interested in distributional-dqn are comparing it to the libraries listed below
Sorting:
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆97Sep 3, 2020Updated 5 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆435Nov 28, 2023Updated 2 years ago
- Reinforcement learning algorithm implementation☆10Oct 31, 2021Updated 4 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- C51-DDQN in Keras☆127Nov 8, 2017Updated 8 years ago
- Keras implementation of guide actor-critic for continuous control☆11Mar 12, 2018Updated 7 years ago
- 🐳 Implementation of various Distributional Reinforcement Learning Algorithms using TensorFlow2.☆70Feb 28, 2021Updated 4 years ago
- ROS package for robot learning☆17Oct 16, 2019Updated 6 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- TensorFlow implementation of Deep RL (Reinforcement Learning) papers based on deep Q-learning (DQN)☆10Mar 1, 2018Updated 7 years ago
- Surprise-based intrinsic motivation for deep reinforcement learning☆21Mar 6, 2017Updated 8 years ago
- Actor-critic with experience replay☆256Oct 9, 2022Updated 3 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆361Jun 2, 2020Updated 5 years ago
- reinforcement learning. policy gradient. PCL☆37Apr 25, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- on-policy optimization baselines for deep reinforcement learning☆32Apr 3, 2020Updated 5 years ago
- Code for Diagnosing Bottlenecks in Deep Q-learning. Contains implementations of tabular environments plus solvers.☆19May 14, 2019Updated 6 years ago
- These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implem…☆17Sep 20, 2017Updated 8 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago
- Publicly releasable baselines for the Retro contest☆129Nov 22, 2018Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆186Nov 1, 2017Updated 8 years ago
- Implementation of clipped action policy gradient (CAPG) with PPO and TRPO☆31Jun 24, 2018Updated 7 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆51Jan 16, 2019Updated 7 years ago
- ☆101Aug 15, 2016Updated 9 years ago
- Collaborative Deep Reinforcement Learning☆31Jul 29, 2017Updated 8 years ago
- ☆13May 15, 2025Updated 9 months ago
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,660Jan 13, 2022Updated 4 years ago
- Review notes for EE127/227A, based on the Spring 2017 iteration of the course.☆22Dec 4, 2017Updated 8 years ago
- Tutorials on learning and using successor representations.☆54Oct 31, 2019Updated 6 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆37Oct 14, 2020Updated 5 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆530Nov 22, 2022Updated 3 years ago