dgriff777 / rl_a3c_pytorch
A3C LSTM Atari with Pytorch plus A3G design
☆563Updated last year
Related projects: ⓘ
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,223Updated 4 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,091Updated 3 years ago
- Accompanying repository for Let's make a DQN / A3C series.☆392Updated 6 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆307Updated 3 years ago
- Hybrid CPU/GPU implementation of the A3C algorithm for deep reinforcement learning.☆651Updated 4 years ago
- Deep Q-Learning Network in pytorch (not actively maintained)☆384Updated 6 years ago
- Simple A3C implementation with pytorch + multiprocessing☆607Updated last year
- Trust Region Policy Optimization with TensorFlow and OpenAI Gym☆360Updated 4 years ago
- Rainbow: Combining Improvements in Deep Reinforcement Learning☆1,565Updated 2 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆662Updated 3 years ago
- PyTorch implementation of Trust Region Policy Optimization☆431Updated 6 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,197Updated 9 months ago
- Contains high quality implementations of Deep Reinforcement Learning algorithms written in PyTorch☆1,052Updated 3 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆411Updated 9 months ago
- Implementation of TRPO and related algorithms☆617Updated 6 years ago
- This repository contains model-free deep reinforcement learning algorithms implemented in Pytorch☆440Updated 5 years ago
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆563Updated 6 years ago
- PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.☆389Updated 3 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆366Updated 5 years ago
- Code for the paper "Exploration by Random Network Distillation"☆873Updated 3 years ago
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated 5 months ago
- PyTorch Agent Net: reinforcement learning toolkit for pytorch☆528Updated 2 months ago
- Actor-critic with experience replay☆251Updated last year
- Prioritized Experience Replay (PER) implementation in PyTorch☆302Updated 4 years ago
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆272Updated 6 years ago
- Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow☆550Updated 2 years ago
- Implementation of Meta-RL A3C algorithm☆401Updated 7 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆643Updated 4 months ago
- Code for the paper "Generative Adversarial Imitation Learning"☆685Updated 5 years ago
- Deep Reinforcement Learning with pytorch & visdom☆798Updated 4 years ago