Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆176Nov 14, 2024Updated last year
Alternatives and similar repositories for REDQ
Users that are interested in REDQ are comparing it to the libraries listed below
Sorting:
- Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)☆21Mar 12, 2021Updated 4 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆71Jul 17, 2025Updated 7 months ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆393Dec 18, 2021Updated 4 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 2 years ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆532Nov 22, 2022Updated 3 years ago
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆97Jun 19, 2025Updated 8 months ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆191May 17, 2022Updated 3 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Apr 1, 2022Updated 3 years ago
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆185Apr 12, 2022Updated 3 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆86Jun 12, 2022Updated 3 years ago
- Guide on how to set up openai gym and mujoco for deep reinforcement learning research.☆16Jan 12, 2021Updated 5 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- ☆12Apr 25, 2022Updated 3 years ago
- Code for the paper "Gamma-Models: Generative Temporal Difference Learning for Infinite-Horizon Prediction"☆46Sep 20, 2023Updated 2 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆657Apr 6, 2021Updated 4 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆227May 19, 2024Updated last year
- ☆15Oct 20, 2020Updated 5 years ago
- Code for conservative Q-learning☆474Dec 7, 2021Updated 4 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆196Dec 8, 2022Updated 3 years ago
- Repository for the paper "Long-Horizon Visual Planning with Goal-Conditioned Hierarchical Predictors"☆46Nov 22, 2022Updated 3 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- ☆18Jul 13, 2022Updated 3 years ago
- PyTorch implementation of Munchausen Reinforcement Learning based on DQN and SAC. Handles discrete and continuous action spaces☆15Oct 3, 2021Updated 4 years ago
- ☆120Feb 25, 2025Updated last year
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- Co-Adaptation of Algorithmic and Implementational Innovations in Inference-based Deep Reinforcement Learning (NeurIPS2021)☆20Oct 25, 2021Updated 4 years ago
- ☆60Feb 3, 2023Updated 3 years ago
- Experiment code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆473Jul 6, 2023Updated 2 years ago
- ☆28Jan 11, 2021Updated 5 years ago
- CURL: Contrastive Unsupervised Representation Learning for Sample-Efficient Reinforcement Learning☆599Oct 28, 2020Updated 5 years ago
- An index of algorithms for offline reinforcement learning (offline-rl)☆1,052May 23, 2024Updated last year
- Author's PyTorch implementation of TD3 for OpenAI gym tasks☆2,034Jul 14, 2023Updated 2 years ago
- Code for Stabilizing Off-Policy RL via Bootstrapping Error Reduction☆163Jul 17, 2020Updated 5 years ago
- ☆385Feb 13, 2023Updated 3 years ago