BY571/Implicit-Q-Learning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/BY571/Implicit-Q-Learning)

BY571 / Implicit-Q-Learning

PyTorch implementation of the implicit Q-learning algorithm (IQL)

☆44

Alternatives and similar repositories for Implicit-Q-Learning

Users that are interested in Implicit-Q-Learning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gwthomas / IQL-PyTorch
View on GitHub
A PyTorch implementation of Implicit Q-Learning
☆99Oct 23, 2021Updated 4 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
Manchery / iql-pytorch
View on GitHub
Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL
☆24Nov 4, 2024Updated last year
google-deepmind / constrained_optidice
View on GitHub
☆10Sep 9, 2022Updated 3 years ago
albertwilcox / mcac
View on GitHub
Author implementation of Monte Carlo Augmented Actor Critic in PyTorch
☆18Oct 24, 2022Updated 3 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sparkmxy / my-offlinerl
View on GitHub
☆26Jun 14, 2022Updated 4 years ago
vikashplus / unitree_sim
View on GitHub
MuJoCo models for Unitree Robots
☆12Nov 24, 2021Updated 4 years ago
ikostrikov / implicit_q_learning
View on GitHub
☆330Jan 23, 2022Updated 4 years ago
ReinholdM / Papers-of-Offline-RL
View on GitHub
Related papers for offline reforcement learning (we mainly focus on representation and sequence modeling and conventional offline RL)
☆18Apr 21, 2022Updated 4 years ago
young-geng / CQL
View on GitHub
Conservative Q Learning on top of SAC
☆140Oct 15, 2022Updated 3 years ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
Egiob / DiversityIsAllYouNeed-SB3
View on GitHub
Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.
☆13Jul 11, 2022Updated 4 years ago
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
hari-sikchi / offline_rl
View on GitHub
Pytorch implementation of state-of-the-art offline reinforcement learning algorithms.
☆23Aug 27, 2022Updated 3 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
YangRui2015 / RIQL
View on GitHub
[ICLR 2024 Spotlight] Code for ICLR 2024 paper "Towards Robust Offline Reinforcement Learning under Diverse Data Corruption"
☆22Nov 25, 2024Updated last year
Farama-Foundation / minari-dataset-generation-scripts
View on GitHub
Scripts to recreate the D4RL datasets with Minari
☆26Jul 4, 2026Updated 2 weeks ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
maxreciprocate / offline
View on GitHub
Offline RL experiments
☆15Oct 1, 2022Updated 3 years ago
takuseno / d4rl-pybullet
View on GitHub
Datasets for data-driven deep reinforcement learning with PyBullet environments
☆152Mar 19, 2021Updated 5 years ago
joonaspu / video-game-behavioural-cloning
View on GitHub
Behavioural cloning experiments with video games
☆32Apr 15, 2020Updated 6 years ago
hari-sikchi / AWAC
View on GitHub
Advantage weighted Actor Critic for Offline RL
☆53Aug 27, 2022Updated 3 years ago
YangRui2015 / AWGCSL
View on GitHub
Code for ICLR 2022 paper Rethinking Goal-Conditioned Supervised Learning and Its Connection to Offline RL.
☆27Feb 21, 2022Updated 4 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
joeybose / FloRL
View on GitHub
Implicit Normalizing Flows + Reinforcement Learning
☆62May 31, 2019Updated 7 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
denisyarats / exorl
View on GitHub
ExORL: Exploratory Data for Offline Reinforcement Learning
☆137Feb 8, 2022Updated 4 years ago
HxLyn3 / MPPVE
View on GitHub
☆10Sep 19, 2023Updated 2 years ago
yobibyte / amorpheus
View on GitHub
My Body Is A Cage
☆41Apr 13, 2021Updated 5 years ago
sa-and / MCD
View on GitHub
☆12Mar 21, 2024Updated 2 years ago
DHDev0 / Muzero-unplugged
View on GitHub
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆36Jun 25, 2025Updated last year
young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
amazon-science / causal-self-compatibility
View on GitHub
Code to reproduce the experiments from the paper "Self-Compatibility: Evaluating Causal Discovery without Ground Truth"
☆12Mar 9, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
hcmlab / GANterfactual-RL
View on GitHub
Counterfactual explanations for Reinforcement Learning agents on Atari
☆12Apr 3, 2023Updated 3 years ago
2019ChenGong / Offline_RL_Poisoner
View on GitHub
[S&P 2024] Replication Package for "Mind Your Data! Hiding Backdoors in Offline Reinforcement Learning Datasets".
☆33Dec 30, 2024Updated last year
Junyoungpark / Pytorch-AWAC
View on GitHub
A PyTorch implementation of Advantage weighted Actor-Critic (AWAC)
☆56Mar 30, 2021Updated 5 years ago
jakegrigsby / deep_control
View on GitHub
Deep Reinforcement Learning for Continuous Control in PyTorch
☆106Dec 31, 2021Updated 4 years ago
pvili / SpikingTimeDependentPlasticity
View on GitHub
The code to simulate spiking neural networks as used in the paper "Spiking Time-Dependent Plasticity Leads to Efficient Coding of Predict…
☆10Nov 24, 2019Updated 6 years ago
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,367Aug 3, 2023Updated 2 years ago
Princeton-RL / normalising-flows-4-reinforcement-learning
View on GitHub
Code for the paper Normalizing Flows are Capable Models for RL
☆20Jun 3, 2025Updated last year