aviralkumar2907 / BEARLinks
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆161Updated 5 years ago
Alternatives and similar repositories for BEAR
Users that are interested in BEAR are comparing it to the libraries listed below
Sorting:
- Code for MOPO: Model-based Offline Policy Optimization☆183Updated 3 years ago
- ☆201Updated 2 years ago
- ☆92Updated last year
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆127Updated 4 years ago
- ☆61Updated 7 years ago
- ☆113Updated 2 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆191Updated 2 years ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆171Updated 9 months ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆149Updated 3 years ago
- Code accompanying the paper "Better Exploration with Optimistic Actor Critic" (NeurIPS 2019)☆68Updated 2 years ago
- ☆274Updated 7 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆215Updated 2 years ago
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆260Updated 5 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆140Updated last year
- Soft Actor-Critic☆152Updated 7 years ago
- ☆132Updated last year
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆95Updated 5 years ago
- Curiosity-driven Exploration by Self-supervised Prediction☆139Updated 2 years ago
- ☆75Updated last year
- Model-Based Offline Reinforcement Learning☆51Updated 4 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆179Updated 3 years ago
- Pytorch implementation of "FeUdal Networks for Hierarchical Reinforcement Learning" for Montezuma's Revenge☆96Updated 3 years ago
- Implementation of VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-Learning - Zintgraf et al. (ICLR 2020)☆192Updated 2 years ago
- DHER: Hindsight Experience Replay for Dynamic Goals (ICLR-2019)☆66Updated 5 years ago
- ☆99Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 3 years ago
- ☆42Updated 3 years ago
- Code accompanying NeurIPS 2019 paper: "Distributional Policy Optimization - An Alternative Approach for Continuous Control"☆22Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆73Updated 2 years ago
- Curriculum-guided Hindsight Experience Replay (NeurIPS-2019)☆65Updated 5 years ago