Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆163Jul 17, 2020Updated 5 years ago
Alternatives and similar repositories for BEAR
Users that are interested in BEAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆660Apr 6, 2021Updated 4 years ago
- Code for conservative Q-learning☆476Dec 7, 2021Updated 4 years ago
- ☆203Mar 25, 2023Updated 3 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- Implementation of advantage-weighted regression.☆209May 30, 2020Updated 5 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 3 years ago
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆560Jun 26, 2023Updated 2 years ago
- A collection of reference environments for offline reinforcement learning☆1,663Nov 18, 2024Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆400Dec 18, 2021Updated 4 years ago
- Collection of reinforcement learning algorithms☆2,884Jun 17, 2024Updated last year
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆537Nov 22, 2022Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)☆80Aug 14, 2022Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- OptiDICE: Offline Policy Optimization via Stationary Distribution Correction Estimation☆16Aug 3, 2023Updated 2 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆131Mar 21, 2021Updated 5 years ago
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,059May 23, 2024Updated last year
- Implementation of "Sample-Efficient Deep Reinforcement Learning via Episodic Backward Update", NeurIPS 2019.☆16Sep 24, 2019Updated 6 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆599Oct 28, 2020Updated 5 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,044Jul 14, 2023Updated 2 years ago
- An offline deep reinforcement learning library☆1,648Sep 10, 2025Updated 6 months ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆44Sep 19, 2021Updated 4 years ago
- Conservative Q Learning on top of SAC☆138Oct 15, 2022Updated 3 years ago
- [NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"☆57Apr 6, 2023Updated 2 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,416Nov 29, 2023Updated 2 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆76Mar 16, 2023Updated 3 years ago
- This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)☆11Sep 14, 2017Updated 8 years ago
- Real-World RL Benchmark Suite☆365Aug 11, 2020Updated 5 years ago
- ☆26Mar 16, 2023Updated 3 years ago
- PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.☆38Jun 22, 2022Updated 3 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- ICML 2018 Self-Imitation Learning☆277Apr 18, 2020Updated 5 years ago
- ☆399Jul 18, 2019Updated 6 years ago