A working implementation of the Categorical DQN (Distributional RL).
☆95Apr 7, 2018Updated 8 years ago
Alternatives and similar repositories for categorical-dqn
Users that are interested in categorical-dqn are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- [adversarial] examples and training cost☆19Jun 29, 2016Updated 9 years ago
- Solving The Malmo Collaborative AI Challenge☆59Jul 23, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Implementation of 'A Distributional Perspective on Reinforcement Learning' and 'Distributional Reinforcement Learning with Quantile Regre…☆134May 5, 2019Updated 7 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- ☆58Aug 28, 2018Updated 7 years ago
- Torch implementation of "Deep Exploration via Bootstrapped DQN"☆42Apr 10, 2016Updated 10 years ago
- Code to reproduce Supervised Policy Update (ICLR 2019)☆17Dec 8, 2022Updated 3 years ago
- ☆15Sep 5, 2016Updated 9 years ago
- ☆38Mar 6, 2017Updated 9 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 8 years ago
- Deeper DCGAN with AE stabilization☆38Mar 20, 2024Updated 2 years ago
- Reinforcement learning models in ViZDoom environment☆130Mar 9, 2022Updated 4 years ago
- Playground for reinforcement learning algorithms implemented in TensorFlow☆16Oct 18, 2016Updated 9 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- PyTorch bindings for openai-gemm☆20Feb 6, 2017Updated 9 years ago
- Feature Control as Intrinsic Motivation for Hierarchical Reinforcement Learning☆81Nov 22, 2017Updated 8 years ago
- Implementation of "Control of Memory, Active Perception, and Action in Minecraft"☆86Dec 23, 2016Updated 9 years ago
- A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)☆57Aug 25, 2017Updated 8 years ago
- Local experiment manager☆14Jan 16, 2026Updated 3 months ago
- Malmo Collaborative AI Challenge - Team Pig Catcher☆64May 22, 2017Updated 8 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 5 years ago
- ☆53Mar 23, 2017Updated 9 years ago
- C51-DDQN in Keras☆127Nov 8, 2017Updated 8 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- E2C implementation in PyTorch☆43Jul 5, 2017Updated 8 years ago
- Implement A3C for Mujoco gym envs☆73Nov 2, 2017Updated 8 years ago
- ☆21May 24, 2016Updated 9 years ago
- Record demonstrations for µniverse☆21Jan 3, 2019Updated 7 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆278Feb 20, 2018Updated 8 years ago
- ☆45Apr 25, 2017Updated 9 years ago
- ☆18Apr 25, 2016Updated 10 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆224Mar 29, 2017Updated 9 years ago
- Understanding Short-Horizon Bias in Stochastic Meta-Optimization☆37Mar 8, 2018Updated 8 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Decoupled Neural Interfaces using Synthetic Gradients for PyTorch☆237Jan 12, 2019Updated 7 years ago
- ☆160Jul 21, 2017Updated 8 years ago
- Solving reinforcement learning tasks which require language and vision☆33Apr 4, 2023Updated 3 years ago
- ML/DL/RL paper notes☆21Dec 19, 2018Updated 7 years ago
- ☆137Oct 23, 2017Updated 8 years ago
- Persistent advantage learning dueling double DQN for the Arcade Learning Environment☆263Feb 8, 2018Updated 8 years ago