MishaLaskin / curl
CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning
☆569Updated 3 years ago
Related projects: ⓘ
- Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)☆471Updated last year
- PyTorch implementation of Soft Actor-Critic (SAC)☆493Updated 2 years ago
- Dream to Control: Learning Behaviors by Latent Imagination☆506Updated 3 years ago
- DrQ: Data regularized Q☆401Updated last year
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆526Updated last year
- Deep Planning Network: Control from pixels by latent planning with learned dynamics☆359Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆475Updated last year
- ☆382Updated 5 years ago
- Code for conservative Q-learning☆393Updated 2 years ago
- ☆328Updated last year
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆424Updated last year
- A PyTorch Platform for Distributed RL☆737Updated 3 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆277Updated 8 months ago
- Prioritized Experience Replay (PER) implementation in PyTorch☆302Updated 4 years ago
- Random Network Distillation pytorch☆239Updated 5 years ago
- RAD: Reinforcement Learning with Augmented Data☆400Updated 3 years ago
- Code for the paper "Phasic Policy Gradient"☆245Updated last year
- PyTorch implementation of Trust Region Policy Optimization☆431Updated 6 years ago
- Tools for accelerating safe exploration research.☆495Updated last year
- Code for "Unsupervised State Representation Learning in Atari"☆239Updated 10 months ago
- Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO☆191Updated last year
- Reimplementation of World-Models (Ha and Schmidhuber 2018) in pytorch☆558Updated 2 years ago
- PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) and Generative Adversarial Imitation Lear…☆1,092Updated 3 years ago
- This is the pytorch implementation of Hindsight Experience Replay (HER) - Experiment on all fetch robotic environments.☆392Updated 2 years ago
- A curated list of awesome imitation learning resources and publications☆511Updated 7 months ago
- A collection of reference environments for offline reinforcement learning☆1,290Updated 7 months ago
- RL starter files in order to immediately train, visualize and evaluate an agent without writing any line of code☆644Updated 4 months ago
- DrQ-v2: Improved Data-Augmented Reinforcement Learning☆344Updated 2 years ago
- Multi Task RL Baselines☆221Updated 2 years ago
- PyTorch implementation of deep reinforcement learning algorithms☆485Updated 2 years ago