Author's PyTorch implementation of Randomized Ensembled Double Q-Learning (REDQ) algorithm.
☆182Nov 14, 2024Updated last year
Alternatives and similar repositories for REDQ
Users that are interested in REDQ are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Pytorch implementation of Randomized Ensembled Double Q-learning (REDQ)☆21Mar 12, 2021Updated 5 years ago
- Source files to replicate experiments in my ICLR 2022 paper.☆73Jul 17, 2025Updated 10 months ago
- Guide on how to set up openai gym and mujoco for deep reinforcement learning research.☆16Jan 12, 2021Updated 5 years ago
- ☆18Jul 13, 2022Updated 3 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆408Dec 18, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Explorer is a PyTorch reinforcement learning framework for exploring new ideas.☆98Jun 19, 2025Updated 11 months ago
- Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"☆544Nov 22, 2022Updated 3 years ago
- [ICLR 22] Value Gradient weighted Model-Based Reinforcement Learning.☆25Apr 15, 2023Updated 3 years ago
- ICRL 2020☆20Feb 18, 2020Updated 6 years ago
- Code for MOPO: Model-based Offline Policy Optimization☆190May 17, 2022Updated 4 years ago
- Learning Action-Value Gradients in Model-based Policy Optimization☆32Sep 7, 2021Updated 4 years ago
- Author's PyTorch implementation of BCQ for continuous and discrete actions☆663Apr 6, 2021Updated 5 years ago
- 🔍 Codebase for the ICML '20 paper "Ready Policy One: World Building Through Active Learning" (arxiv: 2002.02693)☆18Jul 6, 2023Updated 2 years ago
- ☆60Feb 3, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A pytorch reprelication of the model-based reinforcement learning algorithm MBPO☆188Apr 12, 2022Updated 4 years ago
- MVE: model-based value estimation☆11Jul 30, 2018Updated 7 years ago
- Codebase of Deployment-Efficient Reinforcement Learning via Model-Based Offline Optimization (ICLR2021)☆54Jul 7, 2021Updated 4 years ago
- Code to accompany the paper "The Information Geometry of Unsupervised Reinforcement Learning"☆20Oct 6, 2021Updated 4 years ago
- Unofficial Pytorch code for "Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models"☆197Dec 8, 2022Updated 3 years ago
- papers about reinforcement learning☆13Jan 4, 2021Updated 5 years ago
- OpenAI Gym wrapper for the DeepMind Control Suite☆229May 19, 2024Updated 2 years ago
- Code for conservative Q-learning☆484Dec 7, 2021Updated 4 years ago
- Easy MDPs and grid worlds with accessible transition dynamics to do exact calculations☆49Apr 1, 2022Updated 4 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ExORL: Exploratory Data for Offline Reinforcement Learning☆131Feb 8, 2022Updated 4 years ago
- Policy Expansion for Bridging Offline-to-Online Reinforcement Learning (ICLR23)☆64Apr 4, 2023Updated 3 years ago
- Synthetic Experience Replay☆112Apr 16, 2026Updated last month
- Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"☆94Jun 4, 2024Updated last year
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- Proto-RL: Reinforcement Learning with Prototypical Representations☆86Jun 12, 2022Updated 3 years ago
- ☆399Feb 13, 2023Updated 3 years ago
- Code for "Temporal Difference Learning for Model Predictive Control"☆513Nov 25, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Estimating Q(s,s') with Deep Deterministic Dynamics Gradients☆32Feb 21, 2020Updated 6 years ago
- ☆128Feb 25, 2025Updated last year
- Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"☆23Sep 7, 2025Updated 8 months ago