p-morais / deep-rl
Pytorch-based python library for continuous reinforcement learning and imitation learning [superseded by @osudrl/apex]
☆13Updated 4 years ago
Related projects: ⓘ
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 5 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆51Updated 5 years ago
- Inferring beliefs about dynamics from behavior☆28Updated 6 years ago
- Great resources for learning optimal control☆16Updated 5 years ago
- Models built with TensorFlow☆25Updated 5 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆33Updated last year
- A PyTorch implementation of visual interaction networks☆12Updated 5 years ago
- ☆35Updated this week
- Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018☆14Updated 2 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Autoregressive policies for continuous control reinforcement learning☆28Updated 5 years ago
- ☆53Updated 2 years ago
- Reward Learning by Simulating the Past☆43Updated 5 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 2 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 4 years ago
- Comp 781 Project☆8Updated 5 years ago
- Reinforcement Learning papers on exploration methods.☆20Updated 3 years ago
- Code for our paper: Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies☆15Updated 5 years ago
- ☆44Updated 5 years ago
- ☆54Updated 6 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- Decoupling Dynamics and Reward for Transfer Learning☆16Updated 6 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Updated last year
- ☆16Updated this week
- ☆21Updated 4 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- A collection of code investigating the use of information theory for abstractions in RL☆15Updated 5 years ago
- ☆32Updated 5 years ago