adik993 / reinforcement-learning-suttonLinks

☆15

Alternatives and similar repositories for reinforcement-learning-sutton

Users that are interested in reinforcement-learning-sutton are comparing it to the libraries listed below

Sorting:

montrealrobotics / active-domainrand
Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)
☆98Updated 4 years ago
montrealrobotics / domain-randomizer
A standalone library to randomize various OpenAI Gym Environments
☆63Updated 5 years ago
mengf1 / DHER
DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)
☆66Updated 5 years ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆45Updated 4 years ago
junjungoal / IMPALA-pytorch
PyTorch IMPALA implementation
☆26Updated 5 years ago
msinto93 / D4PG
Tensorflow implementation of a Deep Distributed Distributional Deterministic Policy Gradients (D4PG) network, trained on OpenAI Gym envir…
☆125Updated 5 years ago
maximilianigl / DVRL
Deep Variational Reinforcement Learning
☆136Updated 2 years ago
WilsonWangTHU / POPLIN
☆99Updated 2 years ago
RomainLaroche / SPIBB
Safe Policy Improvement with Baseline Bootstrapping
☆26Updated 5 years ago
aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆160Updated 4 years ago
localminimum / hindsight-experience-replay
Hindsight Experience Replay - Bit flipping experiment in Tensorflow
☆58Updated 6 years ago
implementation-matters / code-for-paper
☆111Updated 5 years ago
tdavchev / option-critic
A Tensorflow implementation of the Option-Critic Architecture
☆72Updated 8 years ago
ewanlee / ICLR2019-RL-Papers
The Reinforcement-Learning-Related Papers of ICLR 2019
☆47Updated 6 years ago
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆49Updated 3 years ago
tesslerc / GAC
Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"
☆22Updated 5 years ago
Hwhitetooth / lirpg
☆61Updated 6 years ago
YuhangSong / Arena-BuildingToolkit
Arena: A General Evaluation Platform and Building Toolkit for Single/Multi-Agent Intelligence. AAAI 2020.
☆83Updated 4 years ago
ben-eysenbach / sac
Soft Actor-Critic
☆147Updated 7 years ago
mcgillmrl / prob_mbrl
A library of probabilistic model based RL algorithms in pytorch
☆107Updated 4 years ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆28Updated 6 years ago
YuejiangLIU / prioritized_option_critic
Implementation of the Prioritized Option-Critic on the Four-Rooms Environment
☆16Updated 7 years ago
kushagra06 / SAC
Pytorch implementation of Soft Actor-Critic
☆19Updated 5 years ago
johannah / bootstrap_dqn
Implementation of Bootstrap DQN and Randomized Prior Functions on ALE
☆55Updated 2 months ago
hermesdt / reinforcement-learning
☆39Updated 5 years ago
mila-iqia / teamgrid
Multiagent gridworld for the TEAM project based on gym-minigrid
☆12Updated 5 years ago
rlai-lab / Regularized-GradientTD
Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.
☆38Updated 4 years ago
thanard / me-trpo
☆91Updated last year
nikhilbarhate99 / TD3-PyTorch-BipedalWalker-v2
Twin Delayed DDPG (TD3) PyTorch solution for Roboschool and Box2d environment
☆106Updated 6 years ago
RobertTLange / spinningup-workspace
Reading notes & PyTorch experiments on OpenAI's "Spinning Up in DRL" tutorial.
☆38Updated 2 years ago