p-morais / deep-rl

Pytorch-based python library for continuous reinforcement learning and imitation learning [superseded by @osudrl/apex]

☆13

Related projects: ⓘ

facebookresearch / reward-estimator-corl
Reward Estimation for Variance Reduction in Deep Reinforcement Learning
☆21Updated 5 years ago
facebookresearch / modeling_long_term_future
Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future
☆51Updated 5 years ago
rddy / isql
Inferring beliefs about dynamics from behavior
☆28Updated 6 years ago
martinseilair / learningoptimalcontrol
Great resources for learning optimal control
☆16Updated 5 years ago
ofirnachum / models
Models built with TensorFlow
☆25Updated 5 years ago
edbeeching / 3d_control_deep_rl
Baselines and memory-based scenarios for the ViZDoom simulator
☆33Updated last year
stelzner / Visual-Interaction-Networks
A PyTorch implementation of visual interaction networks
☆12Updated 5 years ago
YuhangSong / Arena-Baselines-Depreciated
☆35Updated this week
Santara / RAIL
Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018
☆14Updated 2 years ago
flowersteam / Unsupervised_Goal_Space_Learning
Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"
☆21Updated 6 years ago
google-research / policy-learning-landscape
Explore the optimization landscape for direct policy learning reinforcement learning.
☆50Updated 5 years ago
kindredresearch / arp
Autoregressive policies for continuous control reinforcement learning
☆28Updated 5 years ago
yuqingd / sim2real2sim_rad
☆53Updated 2 years ago
HumanCompatibleAI / rlsp
Reward Learning by Simulating the Past
☆43Updated 5 years ago
Santara / stochastic_value_gradient
Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]
☆25Updated 2 years ago
nosyndicate / pytorchrl
Deep Reinforcement Learning algorithms implemented in PyTorch
☆49Updated 6 years ago
supratikp / HOOF
Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583
☆17Updated 4 years ago
ArmaanSethi / Hindsight-Experience-Replay-and-Hierarchical-Reinforcement-Learning
Comp 781 Project
☆8Updated 5 years ago
seungjaeryanlee / rl-exploration
Reinforcement Learning papers on exploration methods.
☆20Updated 3 years ago
KMarino / hrl-ep3
Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies
☆15Updated 5 years ago
Feryal / craft-env
☆44Updated 5 years ago
zuoxingdong / DeepPILCO
☆54Updated 6 years ago
cjm715 / mgym
A collection of multi-agent reinforcement learning OpenAI gym environments
☆44Updated 4 years ago
facebookresearch / ddr
Decoupling Dynamics and Reward for Transfer Learning
☆16Updated 6 years ago
mschulth / rhc
Implementation of Receding Horizon Curiosity Algrithm
☆13Updated last year
ikostrikov / pytorch-twin-sac
☆16Updated this week
NeurEXT / NEXT-learning-to-plan
☆21Updated 4 years ago
AdeelMufti / RL-RND
Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation
☆30Updated 5 years ago
david-abel / rl_info_theory
A collection of code investigating the use of information theory for abstractions in RL
☆15Updated 5 years ago
gkahn13 / CAPs
☆32Updated 5 years ago