vy007vikas / PyTorch-ActorCriticRL
PyTorch implementation of DDPG algorithm for continuous action reinforcement learning problem.
☆389Updated 3 years ago
Related projects: ⓘ
- Implementation of the Deep Deterministic Policy Gradient (DDPG) using PyTorch☆563Updated 6 years ago
- A pytorch implementation of MADDPG (multi-agent deep deterministic policy gradient)☆605Updated 6 years ago
- PyTorch implementations of various Deep Reinforcement Learning (DRL) algorithms for both single agent and multi-agent.☆524Updated 6 years ago
- Implementation of algorithms for continuous control (DDPG and NAF).☆307Updated 3 years ago
- Simple A3C implementation with pytorch + multiprocessing☆607Updated last year
- Vanilla DQN, Double DQN, and Dueling DQN implemented in PyTorch☆428Updated 6 years ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆302Updated 4 years ago
- PyTorch implementation of Trust Region Policy Optimization☆431Updated 6 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,091Updated 3 years ago
- Code for "Actor-Attention-Critic for Multi-Agent Reinforcement Learning" ICML 2019☆664Updated 2 years ago
- Repo containing code for multi-agent deep reinforcement learning (MADRL).☆653Updated last year
- PyTorch Implementation of MADDPG (Lowe et. al. 2017)☆552Updated 4 years ago
- Deep Q-Learning Network in pytorch (not actively maintained)☆384Updated 6 years ago
- Mean Field Multi-Agent Reinforcement Learning☆374Updated 4 years ago
- This repository contains most of pytorch implementation based classic deep reinforcement learning algorithms, including - DQN, DDQN, Duel…☆662Updated 3 years ago
- Reimplementation of DDPG(Continuous Control with Deep Reinforcement Learning) based on OpenAI Gym + Tensorflow☆550Updated 2 years ago
- PyTorch implementation of Asynchronous Advantage Actor Critic (A3C) from "Asynchronous Methods for Deep Reinforcement Learning".☆1,223Updated 4 years ago
- Policy Gradient algorithms (REINFORCE, NPG, TRPO, PPO)☆366Updated 5 years ago
- PyTorch implementation of soft actor critic☆796Updated 2 years ago
- Actor-critic with experience replay☆251Updated last year
- Continuous control with deep reinforcement learning - Deep Deterministic Policy Gradient (DDPG) algorithm implemented in OpenAI Gym envir…☆272Updated 6 years ago
- Constrained Policy Optimization☆305Updated 7 years ago
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆471Updated last year
- A3C LSTM Atari with Pytorch plus A3G design☆563Updated last year
- Implementation of benchmark RL algorithms☆458Updated 2 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆588Updated 3 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆492Updated 2 years ago
- Reinforcement Learning with Model-Agnostic Meta-Learning in Pytorch☆816Updated last year
- A continuous action space version of A3C LSTM in pytorch plus A3G design☆258Updated 5 months ago
- Code for hierarchical imitation learning and reinforcement learning☆282Updated 6 years ago