Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction
☆163Jul 17, 2020Updated 5 years ago
Alternatives and similar repositories for BEAR
Users that are interested in BEAR are comparing it to the libraries listed below
Sorting:
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆657Apr 6, 2021Updated 4 years ago
- ☆202Mar 25, 2023Updated 2 years ago
- Code for conservative Q-learning☆474Dec 7, 2021Updated 4 years ago
- Implementation of Random Expert Distillation☆29May 11, 2019Updated 6 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- Ranking Policy Gradient☆23Nov 27, 2019Updated 6 years ago
- Implementation of advantage-weighted regression.☆208May 30, 2020Updated 5 years ago
- A collection of reference environments for offline reinforcement learning☆1,656Nov 18, 2024Updated last year
- Offline Reinforcement Learning (aka Batch Reinforcement Learning) on Atari 2600 games☆560Jun 26, 2023Updated 2 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆393Dec 18, 2021Updated 4 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆532Nov 22, 2022Updated 3 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆55Jul 26, 2019Updated 6 years ago
- Collection of reinforcement learning algorithms☆2,868Jun 17, 2024Updated last year
- [ICML 2022] The official implementation of DWBC in "Discriminator-Weighted Offline Imitation Learning from Suboptimal Demonstrations"☆35Jan 5, 2023Updated 3 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆599Oct 28, 2020Updated 5 years ago
- Meta-Inverse Reinforcement Learning with Probabilistic Context Variables☆76Mar 16, 2023Updated 2 years ago
- SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning☆130Mar 21, 2021Updated 4 years ago
- [ICLR 2022 Spotlight] Code for Reinforcement Learning with Sparse Rewards using Guidance from Offline Demonstration☆28Feb 10, 2022Updated 4 years ago
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Implementation prototype of the Deep Deterministic Off-Policy Gradient (DD-OPG) method.☆11Jun 12, 2019Updated 6 years ago
- Real-World RL Benchmark Suite☆363Aug 11, 2020Updated 5 years ago
- Efficient Exploration via State Marginal Matching (2019)☆69Jun 30, 2019Updated 6 years ago
- [ICML 2019] TensorFlow Code for Self-Supervised Exploration via Disagreement☆129Jun 11, 2019Updated 6 years ago
- Stochastic Latent Actor-Critic: Deep Reinforcement Learning with a Latent Variable Model☆154Oct 26, 2020Updated 5 years ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,052May 23, 2024Updated last year
- Reinforcement Learning with Deep Energy-Based Policies☆436Nov 28, 2023Updated 2 years ago
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,034Jul 14, 2023Updated 2 years ago
- Pytorch implementation of BEAR in "Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction"☆11Oct 29, 2019Updated 6 years ago
- ☆26Mar 16, 2023Updated 2 years ago
- Softlearning is a reinforcement learning framework for training maximum entropy policies in continuous domains. Includes the official imp…☆1,405Nov 29, 2023Updated 2 years ago
- PyTorch implementation of Distribution Correction(DisCor) based on Soft Actor-Critic.☆38Jun 22, 2022Updated 3 years ago
- Implementation of the skill discovery algorithm described in ICLR submission "Option Discovery using Deep Skill Chaining"☆30Sep 24, 2019Updated 6 years ago
- Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees☆93Sep 13, 2019Updated 6 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- ☆99Mar 24, 2023Updated 2 years ago
- An offline deep reinforcement learning library☆1,645Sep 10, 2025Updated 5 months ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆227May 19, 2024Updated last year
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- ☆398Jul 18, 2019Updated 6 years ago