Riashat / Policy-Gradient-Reinforcement-LearningLinks

☆37

Alternatives and similar repositories for Policy-Gradient-Reinforcement-Learning

Users that are interested in Policy-Gradient-Reinforcement-Learning are comparing it to the libraries listed below

Sorting:

sparisi / td-reg
TD-Regularized Actor-Critic Methods
☆36Updated 5 years ago
wuwuwuxxx / Reinforcement-Learning-An-introduction
solutions to the examples and exercises
☆42Updated 9 years ago
wulfebw / hierarchical_rl
hierarchical deep reinforcement learning algorithms
☆41Updated 7 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 6 years ago
jinming99 / DGP-IRL
Deep Gaussian Process for Inverse Reinforcement Learning
☆33Updated 8 years ago
hari-sikchi / safeRL
Safe Reinforcement Learning algorithms
☆74Updated 2 years ago
krasheninnikov / max-causal-ent-irl
Maximum Causal Entropy Inverse Reinforcement Learning
☆48Updated 6 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago
marcino239 / pilco
Using Pilco algorithm to find a controller for few robotic problems
☆43Updated 10 years ago
Riashat / Q-Learning-SARSA-Policy-and-Value-Iteration
Implementation of basic reinforcement learning algorithms (Q-learning, SARSA, Policy iteration and Value Iteration) on benchmark RL MDPs …
☆38Updated 9 years ago
aravindr93 / robustRL
Robust policy search algorithms which train on model ensembles
☆30Updated 8 years ago
ying-wen / malib_deprecated
A Multi-agent Learning Framework
☆62Updated 4 years ago
jangirrishabh / toyCarIRL
Implementation of Inverse Reinforcement Learning Algorithm on a toy car in a 2D world problem, (Apprenticeship Learning via Inverse Reinf…
☆176Updated 3 years ago
florensacc / snn4hrl
Stochastic Neural Networks for Hierarchical Reinforcement Learning
☆96Updated 7 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Updated 3 years ago
MOCR / DDPG
reimplementation of the ddpg algorithm using tensorflow
☆38Updated 8 years ago
TianhongDai / distributed-ppo
This is an pytorch implementation of Distributed Proximal Policy Optimization(DPPO).
☆62Updated 7 years ago
TakuyaHiraoka / Multi-Agent-Reinforcement-Learning-in-Stochastic-Games
Unofficial PyBrain extension for multi-agent reinforcement learning in general sum stochastic games.
☆69Updated 3 weeks ago
befelix / SafeMDP
Safe exploration in Markov Decision Processes
☆37Updated 7 years ago
dnddnjs / feudal-montezuma
Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge
☆96Updated 3 years ago
sanketloke / dlmultiagentsoccer
Deep Reinforcement Learning for Multi Agent Soccer
☆17Updated 8 years ago
mike-gimelfarb / bayesian-reward-shaping
Bayesian Reward Shaping Framework for Deep Reinforcement Learning
☆23Updated 6 years ago
vvanirudh / IRL-Toolkit
IRL Toolkit developed by Sergey Levine (Taken from https://graphics.stanford.edu/projects/gpirl/)
☆62Updated 8 years ago
EndingCredits / Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
☆58Updated 7 years ago
lerrel / gym-adv
Gym environments modified with adversarial agents
☆36Updated 8 years ago
tegg89 / magnet
MAGNet: Multi-agents control using Graph Neural Networks
☆132Updated 6 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆37Updated 6 years ago
xinleipan / gym-gridworld
Simple grid-world environment compatible with OpenAI-gym
☆50Updated 5 years ago
siemanko / guided-policy-search
Implementation is mostly based on Sergey Levine work (http://www.eecs.berkeley.edu/~svlevine/).
☆43Updated 10 years ago