hyparxis / deep-rl
Pytorch-based python library for continuous reinforcement learning and imitation learning [superseded by @osudrl/apex]
☆13Updated 4 years ago
Related projects ⓘ
Alternatives and complementary repositories for deep-rl
- Inferring beliefs about dynamics from behavior☆28Updated 6 years ago
- Baselines and memory-based scenarios for the ViZDoom simulator☆33Updated last year
- Models built with TensorFlow☆25Updated 5 years ago
- Reward Estimation for Variance Reduction in Deep Reinforcement Learning☆21Updated 6 years ago
- Code to reproduce the results in the "Unsupervised Learning of Goal Spaces for Intrinsically Motivated Exploration"☆21Updated 6 years ago
- Code for ICLR 2019 paper Learning Dynamics Model by Incorporating the Long Term Future☆50Updated 5 years ago
- Great resources for learning optimal control☆17Updated 5 years ago
- Simplified version of "State Representation Learning for Control: An Overview" bibliography☆34Updated 5 years ago
- RL framework for embodied agents based on PyTorch☆12Updated 5 years ago
- ☆44Updated 5 years ago
- Autoregressive policies for continuous control reinforcement learning☆28Updated 5 years ago
- A collection of multi-agent reinforcement learning OpenAI gym environments☆44Updated 4 years ago
- Codebase of Santara et. al., RAIL: Risk Averse Imitation Learning, Published in AAMAS 2018☆14Updated 2 years ago
- Implementation of Neural Episodic Control in Tensorflow☆26Updated 5 years ago
- E2C implementation in PyTorch☆43Updated 7 years ago
- mplementation of Advantage Actor Critic (A2C) and Proximal Policy Optimization Algorithm (PPO) use the advantages of Tensorflow 2.x.☆9Updated 4 years ago
- Code for paper "Learning Multimodal Transition Dynamics for Model-Based Reinforcement Learning".☆33Updated 6 years ago
- Explore the optimization landscape for direct policy learning reinforcement learning.☆50Updated 5 years ago
- Full Chainer implementation of OpenAI's Reinforcement Learning using Random Network Distillation☆30Updated 5 years ago
- Implementation of (Learning Continuous Control Policies by Stochastic Value Gradients)[https://arxiv.org/abs/1510.09142]☆25Updated 2 years ago
- Implementation of the Fast Efficient Hyperparameter Tuning for Policy Gradient Methods https://arxiv.org/abs/1902.06583☆17Updated 5 years ago
- ☆21Updated 5 years ago
- ☆47Updated 4 years ago
- A PyTorch implementation of visual interaction networks☆12Updated 5 years ago
- Code accompanying the OptionGAN paper.☆43Updated 6 years ago
- PyTorch implementation of Memory Augmented Self-Play☆50Updated 4 years ago
- Implementation of Receding Horizon Curiosity Algrithm☆13Updated last year
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago