sinaghiassian / OffpolicyAlgorithmsLinks
☆23Updated 3 years ago
Alternatives and similar repositories for OffpolicyAlgorithms
Users that are interested in OffpolicyAlgorithms are comparing it to the libraries listed below
Sorting:
- ☆27Updated 7 months ago
- A customizable framework to create maze and gridworld environments☆268Updated 6 years ago
- ☆114Updated 2 years ago
- Convert DeepMind Control Suite to OpenAI gym environments.☆87Updated 5 years ago
- Proximal Policy Option-Critic☆25Updated 6 years ago
- Hindsight policy gradients☆45Updated 5 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆223Updated last year
- Performances of Reinforcement Learning Agents☆53Updated 5 years ago
- ☆138Updated 6 years ago
- A standalone library to randomize various OpenAI Gym Environments☆63Updated 6 years ago
- OpenAI Gym Wrapper for DeepMind Control Suite☆72Updated 3 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆193Updated 2 years ago
- rllab's viskit with some added features☆73Updated 2 years ago
- Code repository for Active Domain Randomization (CoRL 2019, https://arxiv.org/abs/1904.04762)☆101Updated 4 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Updated 6 years ago
- ☆10Updated 4 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆153Updated 5 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- ☆47Updated 5 years ago
- Episodic Control☆21Updated 3 years ago
- ☆202Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆251Updated 5 years ago
- A3C style Option-Critic with deliberation cost☆39Updated 7 years ago
- TensorFlow implementation for our paper "Exploration via Hindsight Goal Generation"☆23Updated 3 years ago
- Multitask Environments for RL☆280Updated 4 years ago
- Implementation of Tactical Optimistic and Pessimistic value estimation☆25Updated 2 years ago
- ☆13Updated 5 years ago
- Code repo for Gradient Temporal-Difference Learning with Regularized Corrections paper.☆36Updated 5 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆87Updated 6 years ago
- JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"☆102Updated 3 years ago