dgriff777 / rl_a3c_pytorchView external linksLinks
A3C LSTM Atari with Pytorch plus A3G design
☆570Apr 18, 2023Updated 2 years ago
Alternatives and similar repositories for rl_a3c_pytorch
Users that are interested in rl_a3c_pytorch are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,314Sep 25, 2019Updated 6 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆260Oct 11, 2024Updated last year
- Deep Reinforcement Learning with pytorch & visdom☆804Jul 16, 2020Updated 5 years ago
- PyTorch implementation of Advantage Actor Critic (A2C), Proximal Policy Optimization (PPO), Scalable trust-region method for deep reinfor…☆3,875May 29, 2022Updated 3 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆661Feb 25, 2020Updated 5 years ago
- Reinforcement learning models in ViZDoom environment☆130Mar 9, 2022Updated 3 years ago
- A working implementation of the Categorical DQN (Distributional RL).☆95Apr 7, 2018Updated 7 years ago
- Actor-critic with experience replay☆256Oct 9, 2022Updated 3 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆313Feb 16, 2021Updated 4 years ago
- Modularized Implementation of Deep RL Algorithms in PyTorch☆3,406Apr 16, 2024Updated last year
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,660Jan 13, 2022Updated 4 years ago
- A high-performance Atari A3C agent in 180 lines of PyTorch☆173Jul 31, 2021Updated 4 years ago
- [ICML 2017] TensorFlow code for Curiosity-driven Exploration for Deep Reinforcement Learning☆1,471Dec 7, 2022Updated 3 years ago
- Neural Turing Machine (NTM) & Differentiable Neural Computer (DNC) with pytorch & visdom☆278Feb 20, 2018Updated 7 years ago
- PyTorch implementation of Advantage async actor-critic Algorithms (A3C) in PyTorch☆114Apr 3, 2017Updated 8 years ago
- Asynchronous Methods for Deep Reinforcement Learning☆591Aug 9, 2018Updated 7 years ago
- Implementation of Meta-RL A3C algorithm☆407Feb 22, 2017Updated 8 years ago
- pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction☆80Jan 5, 2019Updated 7 years ago
- Reinforcement learning with unsupervised auxiliary tasks☆423Feb 13, 2019Updated 7 years ago
- Implementations of deep RL papers and random experimentation☆178Apr 7, 2018Updated 7 years ago
- rllab is a framework for developing and evaluating reinforcement learning algorithms, fully compatible with OpenAI Gym.☆3,040Jun 10, 2023Updated 2 years ago
- Gym wrapper for Vizdoom environments☆12Dec 14, 2018Updated 7 years ago
- Implement A3C for Mujoco gym envs☆73Nov 2, 2017Updated 8 years ago
- Evolution Strategies in PyTorch☆354Sep 11, 2017Updated 8 years ago
- Pytorch implementation of DeepMind's differentiable neural computer paper.☆93Dec 4, 2017Updated 8 years ago
- Pytorch LSTM RNN for reinforcement learning to play Atari games from OpenAI Universe. We also use Google Deep Mind's Asynchronous Advanta…☆195Sep 19, 2024Updated last year
- lagom: A PyTorch infrastructure for rapid prototyping of reinforcement learning algorithms.☆378Nov 19, 2022Updated 3 years ago
- Open source implementation of the PAAC algorithm presented in Efficient Parallel Methods for Deep Reinforcement Learning☆201Jun 3, 2017Updated 8 years ago
- Noisy Networks for Exploration☆187Jan 28, 2018Updated 8 years ago
- Pytorch implementation of Distributed Proximal Policy Optimization: https://arxiv.org/abs/1707.02286☆184Mar 25, 2018Updated 7 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆394Sep 4, 2018Updated 7 years ago
- Distributed A3C☆34Dec 22, 2017Updated 8 years ago
- PyTorch implementation of Value Iteration Networks (VIN): Clean, Simple and Modular. Visualization in Visdom.☆225Mar 29, 2017Updated 8 years ago
- PyTorch implementation of Trust Region Policy Optimization☆449Sep 13, 2018Updated 7 years ago
- Ape-X DQN & DDPG with pytorch & tensorboard☆102Jun 18, 2019Updated 6 years ago
- Pytorch Implementation of DQN / DDQN / Prioritized replay/ noisy networks/ distributional values/ Rainbow/ hierarchical RL☆3,159Nov 4, 2021Updated 4 years ago
- Implementation of TRPO and related algorithms☆646May 20, 2018Updated 7 years ago
- RUDDER for ATARI games with delayed rewards in OpenAI Baselines package☆268Oct 24, 2019Updated 6 years ago
- PyTorch Implementation of REINFORCE for both discrete & continuous control☆267Apr 16, 2017Updated 8 years ago