MishaLaskin / curlLinks
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆594Updated 5 years ago
Alternatives and similar repositories for curl
Users that are interested in curl are comparing it to the libraries listed below
Sorting:
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆504Updated 3 years ago
- DrQ: Data regularized Q☆418Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆567Updated 4 years ago
- RAD: Reinforcement Learning with Augmented Data☆416Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆518Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆558Updated 2 years ago
- PyTorch implementation of Soft Actor-Critic (SAC)☆571Updated 4 years ago
- ☆398Updated 6 years ago
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆372Updated 4 years ago
- ☆357Updated 3 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆466Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆319Updated last year
- Random Network Distillation pytorch☆256Updated 6 years ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆439Updated 3 years ago
- Code for conservative Q-learning☆464Updated 4 years ago
- ☆273Updated 7 years ago
- PyTorch implementation of Soft Actor-Critic + Autoencoder(SAC+AE)☆253Updated 5 years ago
- Code for the paper "Phasic Policy Gradient"☆267Updated 2 years ago
- Code for "Unsupervised State Representation Learning in Atari"☆255Updated 2 years ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆206Updated 3 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆266Updated 5 years ago
- Tools for accelerating safe exploration research.☆573Updated 2 years ago
- ☆202Updated 2 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆224Updated last year
- PyTorch implementation of Trust Region Policy Optimization☆450Updated 7 years ago
- DQN Zoo is a collection of reference implementations of reinforcement learning agents developed at DeepMind based on the Deep Q-Network (…☆490Updated last year
- Multi Task RL Baselines☆257Updated 3 years ago
- Multitask Environments for RL☆280Updated 4 years ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆703Updated last year
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆649Updated 4 years ago