floringogianu / categorical-dqnView external linksLinks
A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 7 years ago
Alternatives and similar repositories for categorical-dqn
Users that are interested in categorical-dqn are comparing it to the libraries listed below
Sorting:
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- [adversarial] examples and training cost☆19Jun 29, 2016Updated 9 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆133May 5, 2019Updated 6 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- Reinforcement learning models in ViZDoom environment☆130Mar 9, 2022Updated 3 years ago
- Deeper DCGAN with AE stabilization☆38Mar 20, 2024Updated last year
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- ☆58Aug 28, 2018Updated 7 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Apr 10, 2016Updated 9 years ago
- ☆53Mar 23, 2017Updated 8 years ago
- Solving The Malmo Collaborative AI Challenge☆59Jul 23, 2017Updated 8 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- ☆45Apr 25, 2017Updated 8 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Mar 8, 2018Updated 7 years ago
- ☆21May 24, 2016Updated 9 years ago
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- Decoupled Neural Interfaces using Synthetic Gradients for PyTorch☆239Jan 12, 2019Updated 7 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- Implement A3C for Mujoco gym envs☆73Nov 2, 2017Updated 8 years ago
- ☆38Mar 6, 2017Updated 8 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆278Feb 20, 2018Updated 7 years ago
- This is my implementation of the Optimality Tightening☆37Apr 26, 2017Updated 8 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆225Mar 29, 2017Updated 8 years ago
- tensorflow deep RL hacking on minecraft with malmo☆54Jan 17, 2017Updated 9 years ago
- Pytorch implementation of "Forward Thinking: Building and Training Neural Networks One Layer at a Time"☆65Jun 14, 2017Updated 8 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Dec 23, 2016Updated 9 years ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆66May 22, 2017Updated 8 years ago
- ☆33May 17, 2016Updated 9 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 2 years ago
- Reinforcement learning in 3D.☆21Mar 29, 2017Updated 8 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Robust policy search algorithms which train on model ensembles☆30Oct 26, 2016Updated 9 years ago
- Training Sonic with RLlib☆60Apr 2, 2023Updated 2 years ago
- Input Convex Neural Networks☆311Mar 20, 2019Updated 6 years ago