mluogh / reinforcement

Learning to Reinforcement Learn

☆11

Alternatives and similar repositories for reinforcement:

Users that are interested in reinforcement are comparing it to the libraries listed below

abhishm / PGQ
PGQ is an approach to combine Policy Gradient and Q-Learning. This repository will contain an implementation of PGQ.
☆15Updated 7 years ago
DartML / PPO-Stein-Control-Variate
Proximal Policy Optimization with Stein Control Variates:
☆33Updated 7 years ago
marcino239 / pilco
Using Pilco algorithm to find a controller for few robotic problems
☆43Updated 9 years ago
roosephu / slbo
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees
☆55Updated 5 years ago
jvmncs / ParamNoise
A comparison of parameter space noise methods for exploration in deep reinforcement learning
☆27Updated 5 years ago
tmoer / multimodal_varinf
Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".
☆35Updated 6 years ago
martinseilair / dm_control2gym
OpenAI Gym Wrapper for DeepMind Control Suite
☆72Updated 3 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 6 years ago
kindredresearch / arp
Autoregressive policies for continuous control reinforcement learning
☆29Updated 5 years ago
aravindr93 / robustRL
Robust policy search algorithms which train on model ensembles
☆28Updated 8 years ago
wensun / Imitation-Learning-from-Observation
☆23Updated last year
andrewliao11 / pytorch-a3c-mujoco
Implement A3C for Mujoco gym envs
☆72Updated 7 years ago
quanvuong / Supervised_Policy_Update
Code to reproduce Supervised Policy Update (ICLR 2019)
☆17Updated 2 years ago
mfornet / hindsight-experience-replay
Implementation of HER algorithm in the bit-flipping environment.
☆17Updated 7 years ago
YuhangSong / DEHRL
Diversity−Driven Extensible Hierarchical Reinforcement Learning. AAAI 2019.
☆48Updated 5 years ago
rddy / isql
Inferring beliefs about dynamics from behavior
☆28Updated 6 years ago
dibyaghosh / dnc
Code for "Divide-and-Conquer Reinforcement Learning"
☆61Updated 6 years ago
NoListen / ERL
Exploration based Reinforcement Learning. (Montezuma Revenge)
☆14Updated 6 years ago
Feryal / a3c-mujoco
☆28Updated 7 years ago
AdamStelmaszczyk / learning2run
Our NIPS 2017: Learning to Run source code
☆55Updated last year
Breakend / DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"
☆151Updated 7 years ago
go2sea / C51DQN
A TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DQN)
☆56Updated 7 years ago
pfnet-research / capg
Implementation of clipped action policy gradient (CAPG) with PPO and TRPO
☆31Updated 6 years ago
youngwoon / transition
Official code for the paper "Learning Transition Policies for Composing Complex Skills" (ICLR 2019)
☆74Updated 5 years ago
EndingCredits / Neural-Episodic-Control
Implementation of Deepmind's Neural Episodic Control
☆58Updated 6 years ago
jacobandreas / psketch
Modular multitask reinforcement learning with policy sketches
☆108Updated 3 years ago
ilyasu123 / trpo
☆19Updated 8 years ago
kimhc6028 / pytorch-noreward-rl
pytorch implementation of Curiosity-driven Exploration by Self-supervised Prediction
☆79Updated 6 years ago
zuoxingdong / DeepPILCO
☆53Updated 7 years ago
ermongroup / CalibratedModelBasedRL
Code for "Calibrated Model-Based Deep Reinforcement Learning", ICML 2019.
☆55Updated 5 years ago