shivamsaboo17 / Policy-Gradient-PyTorchLinks

Implementation of vanilla stochaistic (categorical) policy gradient algorithm to play cartpole.

☆16

Alternatives and similar repositories for Policy-Gradient-PyTorch

Users that are interested in Policy-Gradient-PyTorch are comparing it to the libraries listed below

Sorting:

hungtuchen / pytorch-hdqn
Hierarchical-DQN in pytorch (not actively maintained)
☆69Updated 8 years ago
cyoon1729 / Policy-Gradient-Methods
Implementation of Algorithms from the Policy Gradient Family. Currently includes: A2C, A3C, DDPG, TD3, SAC
☆99Updated 5 years ago
skumar9876 / Hierarchical-DQN
Implementation of the paper Hierarchical Deep Reinforcement Learning: Integrating Temporal Abstraction and Intrinsic Motivation - https:/…
☆86Updated 7 years ago
rhoowd / sched_net
☆85Updated 3 years ago
cyoon1729 / Multi-agent-reinforcement-learning
Implementation of Multi-Agent Reinforcement Learning algorithm(s). Currently includes: MADDPG
☆64Updated 5 years ago
hsvgbkhgbv / SQDDPG
This is a framework for the research on multi-agent reinforcement learning and the implementation of the experiments in the paper titled …
☆119Updated 7 months ago
ml3705454 / mapr2
☆46Updated 2 years ago
ermongroup / multiagent-gail
☆83Updated 6 years ago
ASzot / ppo-pytorch
Proximal policy optimization in PyTorch. Easy to read and understand.
☆50Updated 4 years ago
sungyubkim / Deep_RL_with_pytorch
A pytorch tutorial for DRL(Deep Reinforcement Learning)
☆218Updated 2 years ago
yilunc2020 / Attention-DQN
Deep Recurrent Attention Reinforcement Learning in Atari
☆85Updated 6 years ago
PKU-RL / DGN
DGN Code
☆353Updated 2 years ago
cardwing / Codes-for-RL-PER
A novel DDPG method with prioritized experience replay (IEEE SMC 2017)
☆50Updated 6 years ago
deligentfool / dqn_zoo
The implement of all kinds of dqn reinforcement learning with Pytorch
☆93Updated 4 years ago
Bigpig4396 / PyTorch-Counterfactual-Multi-Agent-Policy-Gradients-COMA
☆76Updated 5 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
gmargo11 / hDQN
Implementation of Hierarchical Deep Q-Learning (Kulkarni et al., 2016)
☆35Updated 6 years ago
amazon-science / meta-q-learning
Code for the paper "Meta-Q-Learning"( ICLR 2020)
☆103Updated 3 years ago
yufeiwang63 / RLlab
pytorch implementation of DQN, NAF, DDPG
☆13Updated 7 years ago
ChangyWen / wolpertinger_ddpg
Wolpertinger Training with DDPG (Pytorch), Deep Reinforcement Learning in Large Discrete Action Spaces. Multi-GPU/Singer-GPU/CPU compatib…
☆66Updated 2 years ago
saizhang0218 / VBC
pytorch implementation of "Efficient Communication in Multi-Agent Reinforcement Learning via Variance Based Control"
☆53Updated 2 years ago
011235813 / hierarchical-marl
Hierarchical Cooperative Multi-Agent Reinforcement Learning with Skill Discovery
☆103Updated 3 years ago
ermongroup / MA-AIRL
Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.
☆209Updated 6 years ago
nikhil3456 / Deep-Reinforcement-Learning-in-Large-Discrete-Action-Spaces
PyTorch implementation of the paper "Deep Reinforcement Learning in Large Discrete Action Spaces" (Gabriel Dulac-Arnold, Richard Evans, H…
☆71Updated 5 years ago
namidairo777 / Distributed-MADDPG
Distributed Multi-Agent Cooperation Algorithm based on MADDPG with prioritized batch data.
☆105Updated 4 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆47Updated 6 years ago
MadryLab / implementation-matters
☆131Updated 11 months ago
adithya-subramanian / Multi_Agent_Soft_Actor_Critic
A Pytorch Implementation of Multi Agent Soft Actor Critic
☆40Updated 6 years ago
TonghanWang / NDQ
Codes accompanying the paper "Learning Nearly Decomposable Value Functions with Communication Minimization" (ICLR 2020)
☆81Updated 2 years ago
adik993 / ppo-pytorch
Proximal Policy Optimization(PPO) with Intrinsic Curiosity Module(ICM)
☆142Updated 6 years ago