PyTorch implementation of the implicit Q-learning algorithm (IQL)
☆44Dec 17, 2021Updated 4 years ago
Alternatives and similar repositories for Implicit-Q-Learning
Users that are interested in Implicit-Q-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A PyTorch implementation of Implicit Q-Learning☆99Oct 23, 2021Updated 4 years ago
- Anti exploration in offline reinforcement learning☆11May 17, 2021Updated 5 years ago
- ☆10Sep 9, 2022Updated 3 years ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆24Nov 4, 2024Updated last year
- Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)☆18Apr 21, 2022Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Author implementation of Monte Carlo Augmented Actor Critic in PyTorch☆18Oct 24, 2022Updated 3 years ago
- Code for the paper Normalizing Flows are Capable Models for RL☆19Jun 3, 2025Updated 11 months ago
- ☆31Jan 16, 2023Updated 3 years ago
- ☆26Jun 14, 2022Updated 3 years ago
- [ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"☆22Nov 25, 2024Updated last year
- Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.☆23Aug 27, 2022Updated 3 years ago
- ☆329Jan 23, 2022Updated 4 years ago
- Behavioural cloning experiments with video games☆32Apr 15, 2020Updated 6 years ago
- Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL☆408Dec 18, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆12Mar 21, 2024Updated 2 years ago
- Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.☆13Jul 11, 2022Updated 3 years ago
- Counterfactual explanations for Reinforcement Learning agents on Atari☆12Apr 3, 2023Updated 3 years ago
- [S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".☆33Dec 30, 2024Updated last year
- A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)☆56Mar 30, 2021Updated 5 years ago
- Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…☆35Jun 25, 2025Updated 10 months ago
- Advantage weighted Actor Critic for Offline RL☆54Aug 27, 2022Updated 3 years ago
- The code to simulate spiking neural networks as used in the paper "Spiking Time-Dependent Plasticity Leads to Efficient Coding of Predict…☆10Nov 24, 2019Updated 6 years ago
- ☆10Sep 19, 2023Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- ☆10Oct 15, 2020Updated 5 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- ☆11Jun 5, 2023Updated 2 years ago
- Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"☆12Mar 9, 2024Updated 2 years ago
- [NeurIPS 2022] Leveraging Factored Action Spaces for Efficient Offline RL in Healthcare. https://arxiv.org/abs/2305.01738☆11Nov 27, 2022Updated 3 years ago
- ☆11Nov 8, 2022Updated 3 years ago
- Official implementation of HEAD CoRL 2025☆26Aug 22, 2025Updated 8 months ago
- [ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"☆12Apr 22, 2024Updated 2 years ago
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆38Dec 30, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- PyTorch implementation of the Offline Reinforcement Learning algorithm CQL. Includes the versions DQN-CQL and SAC-CQL for discrete and co…☆147May 6, 2024Updated 2 years ago
- Public examples for FORCES NLP☆12Jun 20, 2017Updated 8 years ago
- MATLAB framework for work with WEB services (supports OAuth 1.0/2.0)☆13Apr 16, 2021Updated 5 years ago
- Datasets for data-driven deep reinforcement learning with PyBullet environments☆152Mar 19, 2021Updated 5 years ago
- Application of Deep Reinforcement Learning to Supply Chain management. Reference: https://blog.griddynamics.com/deep-reinforcement-learni…☆12Jul 21, 2021Updated 4 years ago
- [NeurIPS 2023] Conformal Prediction for Uncertainty-Aware Planning with Diffusion Dynamics Model☆20Dec 9, 2023Updated 2 years ago
- A super-lightweight super-capable agentic tool with improved security versus OpenClaw.☆48Updated this week