akshaykhadse / reinforcement-learningLinks

Implementations of basic concepts dealt under the Reinforcement Learning umbrella. This project is collection of assignments in CS747: Foundations of Intelligent and Learning Agents (Autumn 2017) at IIT Bombay

☆17

Alternatives and similar repositories for reinforcement-learning

Users that are interested in reinforcement-learning are comparing it to the libraries listed below

Sorting:

ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆56Updated 6 years ago
jianing-sun / Interpolated-Policy-Gradient-with-PPO-for-Robotics-Control-
Reinforcement Learning for robotics continuous control, mainly based on Proximal Policy Optimization, extending to Interpolated Policy Gr…
☆37Updated 6 years ago
jonasrothfuss / model_ensemble_meta_learning
Implementation of the Model-Based Meta-Policy-Optimization (MB-MPO) algorithm
☆44Updated 6 years ago
zafarali / emdp
Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations
☆49Updated 3 years ago
PKU-RL / FEN
FEN Code
☆38Updated 5 years ago
zuoxingdong / DeepPILCO
☆54Updated 7 years ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆45Updated 5 years ago
ewanlee / ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47Updated 6 years ago
behaviorguidedRL / BGRL
Open source demo for the paper Learning to Score Behaviors for Guided Policy Optimization
☆24Updated 5 years ago
hejia-zhang / awesome-model-based-reinforcement-learning
A curated list of awesome Model-based reinforcement learning resources
☆94Updated 4 years ago
RobertTLange / spinningup-workspace
Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.
☆38Updated 2 years ago
jinming99 / DGP-IRL
Deep Gaussian Process for Inverse Reinforcement Learning
☆33Updated 7 years ago
gopala-kr / DRL-Agents
research and implementations of Deep RL agents and their applications
☆51Updated 3 weeks ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆28Updated 6 years ago
thanard / me-trpo
☆92Updated last year
hari-sikchi / safeRL
Safe Reinforcement Learning algorithms
☆74Updated 2 years ago
wyjung0625 / p3s
Implementation of Population-Guided Parallel Policy Search for Reinforcement Learning
☆22Updated 5 years ago
tesslerc / ActionRobustRL
Code accompanying the paper "Action Robust Reinforcement Learning and Applications in Continuous Control" https://arxiv.org/abs/1901.0918…
☆44Updated 6 years ago
befelix / SafeMDP
Safe exploration in Markov Decision Processes
☆37Updated 7 years ago
louaaron / GAN-Q-Learning
Unofficial Implementation of GAN Q Learning https://arxiv.org/abs/1805.04874
☆47Updated 4 years ago
HumanCompatibleAI / population-irl
(Experimental) Inverse reinforcement learning from trajectories generated by multiple agents with different (but correlated) rewards
☆28Updated 6 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
rems75 / SPIBB-DQN
Code for SPIBB-DQN and Soft-SPIBB-DQN
☆11Updated 5 years ago
Miffyli / rl-action-space-shaping
Experiment code for testing effect of various action space transformations in reinforcement learning
☆30Updated 5 years ago
eugenevinitsky / robust_RL_multi_adversary
We investigate the effect of populations on finding good solutions to the robust MDP
☆28Updated 4 years ago
LihaoR / Entropy-Regularized-RL
soft q learning and soft actor critic
☆15Updated 6 years ago
hari-sikchi / offline_rl
Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.
☆23Updated 2 years ago
fshamshirdar / pytorch-rdpg
PyTorch Implementation of the RDPG (Recurrent Deterministic Policy Gradient)
☆55Updated 2 years ago
ajlangley / trpo-pytorch
An implementation of TRPO with GAE in PyTorch
☆16Updated last year
Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆26Updated 3 years ago