aviralkumar2907 / BEAR
Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆160Updated 4 years ago
Alternatives and similar repositories for BEAR:
Users that are interested in BEAR are comparing it to the libraries listed below
- ☆194Updated 2 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆177Updated 2 years ago
- ☆91Updated last year
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆167Updated 3 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆124Updated 4 years ago
- Learning Invariant Representations for Reinforcement Learning without Reconstruction☆147Updated 3 years ago
- ☆272Updated 6 years ago
- Learning to Adapt in Dynamic, Real-World Environment through Meta-Reinforcement Learning☆211Updated 2 years ago
- ☆60Updated 6 years ago
- Conservative Q Learning on top of SAC☆129Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆355Updated 3 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆188Updated 2 years ago
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆135Updated 11 months ago
- Implementation of Proximal Meta-Policy Search (ProMP) as well as related Meta-RL algorithm. Includes a useful experiment framework for Me…☆237Updated 2 years ago
- Code for the paper "Meta-Q-Learning"( ICLR 2020)☆103Updated 2 years ago
- Paper Collection for Imitation Learning in RL.☆145Updated 3 years ago
- ☆127Updated 8 months ago
- Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.☆163Updated 5 months ago
- Multi-Agent Adversarial Inverse Reinforcement Learning, ICML 2019.☆207Updated 5 years ago
- Adversarial Imitation Via Variational Inverse Reinforcement Learning☆95Updated 5 years ago
- ☆66Updated 4 years ago
- PyTorch implementation of the Option-Critic framework, Harb et al. 2016☆126Updated 8 months ago
- Code for conservative Q-learning☆431Updated 3 years ago
- ☆112Updated last year
- This repository contains the code to implement the Hierarchical Actor-Critic (HAC) algorithm.☆259Updated 4 years ago
- Implementation of Bootstrap DQN and Randomized Prior Functions on ALE☆55Updated last month
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆75Updated 2 years ago
- An implementation of Constrained Policy Optimization (Achiam 2017) in PyTorch☆24Updated 5 years ago
- ☆109Updated 5 years ago
- PyTorch implementation of GAIL and AIRL based on PPO.☆213Updated 4 years ago