pathak22 / exploration-by-disagreement
[ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement
☆123Updated 5 years ago
Related projects ⓘ
Alternatives and complementary repositories for exploration-by-disagreement
- Code for reproducing experiments in Model-Based Active Exploration, ICML 2019☆78Updated 5 years ago
- rllab's viskit with some added features☆73Updated last year
- [NeurIPS 2019] Code for the paper "Learning to Control Self-Assembling Morphologies: A Study of Generalization via Modularity"☆112Updated 4 years ago
- Modifiable OpenAI Gym environments for studying generalization in RL☆86Updated 5 years ago
- This repository contains code for the method and experiments of the paper "Learning with AMIGo: Adversarially Motivated Intrinsic Goals".☆61Updated last year
- Pytorch implementation of LOLA (https://arxiv.org/abs/1709.04326) using DiCE (https://arxiv.org/abs/1802.05098)☆90Updated 6 years ago
- Official implementation of ICML paper Imitating Latent Policies from Observation☆73Updated 5 years ago
- Deep Variational Reinforcement Learning☆134Updated 2 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆149Updated 4 years ago
- Efficient Exploration via State Marginal Matching (2019)☆66Updated 5 years ago
- Automatic Data-Regularized Actor-Critic (Auto-DrAC)☆101Updated last year
- State Representation Learning (SRL) zoo with PyTorch - Part of S-RL Toolbox☆162Updated 5 years ago
- Building Agents with Imagination: pytorch step-by-step implementation☆205Updated 5 years ago
- Modular multitask reinforcement learning with policy sketches☆105Updated 3 years ago
- This is the pytorch implementation of ICML 2018 paper - Self-Imitation Learning.☆65Updated 6 years ago
- A job launching library for docker, EC2, GCP, etc.☆57Updated 3 years ago
- Code for "Divide-and-Conquer Reinforcement Learning"☆60Updated 5 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Updated 5 years ago
- Hindsight policy gradients☆43Updated 4 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆232Updated 2 years ago
- third person imitation learning. Archival only.☆77Updated 5 years ago
- ☆41Updated 6 years ago
- Deep Reinforcement Learning algorithms implemented in PyTorch☆49Updated 6 years ago
- Revisiting Rainbow☆73Updated 3 years ago
- Entity Abstraction in Visual Model-Based Reinforcement Learning☆55Updated 3 years ago
- ☆97Updated last year
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated last year
- ☆130Updated 5 years ago
- Soft Actor-Critic☆141Updated 6 years ago