go2sea / C51DQNView external linksLinks
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆57Aug 25, 2017Updated 8 years ago
Alternatives and similar repositories for C51DQN
Users that are interested in C51DQN are comparing it to the libraries listed below
Sorting:
- Implementation of A Distributional Perspective on Reinforcement Learning☆35Aug 1, 2017Updated 8 years ago
- Tensorflow implementation for "Noisy network for exploration"☆19Aug 2, 2017Updated 8 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- C51-DDQN in Keras☆127Nov 8, 2017Updated 8 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- Reinforcement learning in 3D.☆21Mar 29, 2017Updated 8 years ago
- Collaborative Deep Reinforcement Learning☆31Jul 29, 2017Updated 8 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- Unsupervised Perceptual Rewards for Imitation Learning☆11Feb 3, 2018Updated 8 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Modified tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆21Dec 15, 2016Updated 9 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Tensorflow implementation of 'Asynchronous Methods for Deep Reinforcement Learning'☆13Dec 23, 2016Updated 9 years ago
- RC-NFQ: Regularized Convolutional Neural Fitted Q Iteration. A batch algorithm for deep reinforcement learning. Incorporates dropout regu…☆12Mar 17, 2021Updated 4 years ago
- A Pygame+Pymunk Carrom Simulation Testbed for reinforcement learning. [CS747][ Foundations of Intelligent and Learning Agents]☆15Jun 24, 2019Updated 6 years ago
- A short and easy implementation of Quantile Regression DQN | Distributional Reinforcement Learning☆97Sep 3, 2020Updated 5 years ago
- Implementation of Meta-RL A3C algorithm☆407Feb 22, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Accompanying code for "Deep Reinforcement Learning that Matters"☆155Sep 22, 2017Updated 8 years ago
- A Tutorial on Modeling and Inference in Undirected Graphical Models for Hyperspectral Image Analysis☆21May 8, 2018Updated 7 years ago
- Deep Attention Recurrent Q-Network☆115Nov 7, 2015Updated 10 years ago
- Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow - Tensorlfow Im…☆13Feb 2, 2019Updated 7 years ago
- ☆24Dec 22, 2016Updated 9 years ago
- (Keras) Use deep Q-learning to build two Gomoku (Five-in-a-Row) agents playing against each other.☆19Oct 8, 2016Updated 9 years ago
- Sublinear memory optimization for deep learning, reduce GPU memory cost to train deeper nets☆28Apr 22, 2016Updated 9 years ago
- Towards Visual Explanations for Convolutional Neural Networks via Input Resampling☆13Aug 16, 2017Updated 8 years ago
- Python wrapper for MuJoCo physics simulation.☆12Feb 14, 2019Updated 7 years ago
- An implementation of FeUdal Networks for Hierarchical Reinforcement Learning as published : https://arxiv.org/abs/1703.01161☆186Nov 1, 2017Updated 8 years ago
- Code base for solving Markov Decision Processes and Reinforcement Learning problems using Recurrent Convolutional Neural Networks.☆69Oct 29, 2017Updated 8 years ago
- An aspiring attempt to generate a continuous space of sentences with DenseNet☆26May 4, 2017Updated 8 years ago
- ☆98Aug 25, 2016Updated 9 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 7 years ago
- Gym - Doom environments based on VizDoom.☆104Mar 17, 2017Updated 8 years ago
- Stochastic Neural Networks for Hierarchical Reinforcement Learning☆94Apr 17, 2018Updated 7 years ago
- Implement A3C for Mujoco gym envs☆73Nov 2, 2017Updated 8 years ago
- [NIPS 2017] InfoGAIL: Interpretable Imitation Learning from Visual Demonstrations☆184Nov 14, 2024Updated last year
- Tensorflow implementation of "The Predictron: End-To-End Learning and Planning"☆291Jan 20, 2017Updated 9 years ago
- Tensor Switching Networks☆12Nov 2, 2017Updated 8 years ago