yc930401 / Actor-Critic-pytorchLinks

Actor Critic model to play Cartpole game

☆52

Alternatives and similar repositories for Actor-Critic-pytorch

Users that are interested in Actor-Critic-pytorch are comparing it to the libraries listed below

Sorting:

deligentfool / dqn_zoo
The implement of all kinds of dqn reinforcement learning with Pytorch
☆93Updated 4 years ago
Bigpig4396 / PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
☆76Updated 5 years ago
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆71Updated 5 years ago
skumar9876 / Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…
☆86Updated 7 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 5 years ago
gouxiangchen / dueling-DQN-pytorch
very easy implementation of dueling DQN in pytorch
☆72Updated 2 years ago
DKuan / MADDPG_torch
The code for maddpg using pytorch
☆169Updated 4 years ago
cyanrain7 / TRPO-in-MARL
☆211Updated 2 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago
MadryLab / implementation-matters
☆131Updated 11 months ago
keep9oing / DRQN-Pytorch-CartPole-v1
Deep recurrent Q learning on CartPole-v1 environment
☆91Updated last year
toshikwa / sac-discrete.pytorch
PyTorch implementation of SAC-Discrete.
☆305Updated 11 months ago
nikhilbarhate99 / Hierarchical-Actor-Critic-HAC-PyTorch
PyTorch implementation of Hierarchical Actor Critic (HAC) for OpenAI gym environments
☆317Updated 3 years ago
ChangyWen / wolpertinger_ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…
☆66Updated 2 years ago
Kchu / DeepRL_PyTorch
Deep Reinforcement Learning codes for study. Currently, there are only codes for algorithms: DQN, C51, QR-DQN, IQN, QUOTA.
☆213Updated 2 years ago
dhruvramani / Transformers-RL
An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"
☆180Updated 2 years ago
LxzGordon / Deep-Reinforcement-Learning-with-pytorch
Basic reinforcement learning algorithms. Including:DQN,Double DQN, Dueling DQN, SARSA, REINFORCE, baseline-REINFORCE, Actor-Critic,DDPG,D…
☆92Updated 4 years ago
amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 3 years ago
MoMe36 / BranchingDQN
BranchingDQN
☆50Updated 6 years ago
cycraig / MP-DQN
Source code for the dissertation: "Multi-Pass Deep Q-Networks for Reinforcement Learning with Parameterised Action Spaces"
☆218Updated 6 years ago
choru-k / Reinforcement-Learning-Pytorch-Cartpole
Simple Cartpole example writed with pytorch.
☆169Updated 5 years ago
vwxyzjn / invalid-action-masking
Source Code for A Closer Look at Invalid Action Masking in Policy Gradient Algorithms
☆155Updated 2 years ago
zhangchuheng123 / Reinforcement-Implementation
Implementation of benchmark RL algorithms
☆467Updated 2 years ago
RchalYang / torchrl
Pytorch Implementation of Reinforcement Learning Algorithms ( Soft Actor Critic(SAC)/ DDPG / TD3 /DQN / A2C/ PPO / TRPO)
☆222Updated 2 years ago
XinJingHao / PPO-Continuous-Pytorch
A clean and robust Pytorch implementation of PPO on continuous action space.
☆154Updated last year
XinJingHao / PPO-Discrete-Pytorch
A clean and robust Pytorch implementation of PPO on Discrete action space
☆70Updated last year
RunzheYang / MORL
Multi-Objective Reinforcement Learning
☆277Updated 3 years ago
rlcode / per
Prioritized Experience Replay (PER) implementation in PyTorch
☆344Updated 5 years ago
lweitkamp / option-critic-pytorch
PyTorch implementation of the Option-Critic framework, Harb et al. 2016
☆128Updated 10 months ago
nikhilbarhate99 / Actor-Critic-PyTorch
Policy Gradient Actor-Critic PyTorch | Lunar Lander v2
☆73Updated 6 years ago