young-geng/JaxCQL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/young-geng/JaxCQL)

young-geng / JaxCQL

Conservative Q learning in Jax

☆58

Alternatives and similar repositories for JaxCQL

Users that are interested in JaxCQL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

young-geng / CQL
View on GitHub
Conservative Q Learning on top of SAC
☆141Oct 15, 2022Updated 3 years ago
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆124Jul 31, 2024Updated last year
ryanxhr / IVR
View on GitHub
[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…
☆46Jul 27, 2023Updated 2 years ago
ikostrikov / implicit_q_learning
View on GitHub
☆330Jan 23, 2022Updated 4 years ago
young-geng / tpu_pod_commander
View on GitHub
TPU pod commander is a package for managing and launching jobs on Google Cloud TPU pods.
☆20Sep 24, 2025Updated 10 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
ikostrikov / jaxrl2
View on GitHub
☆58Jan 20, 2023Updated 3 years ago
brentyi / transformer-exercises-jax
View on GitHub
☆18Apr 17, 2026Updated 3 months ago
microsoft / ATAC
View on GitHub
Code accompanying the paper Adversarially Trained Actor Critic for Offline Reinforcement Learning by Ching-An Cheng*, Tengyang Xie*, Nan …
☆74Feb 2, 2023Updated 3 years ago
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
snu-mllab / EDAC
View on GitHub
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆80Aug 14, 2022Updated 3 years ago
polixir / morec
View on GitHub
☆10Mar 11, 2024Updated 2 years ago
philippe-eecs / IDQL
View on GitHub
Repo for Implicit Diffusion Q-Learning
☆126Dec 5, 2023Updated 2 years ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
zzmtsvv / ORL
View on GitHub
☆58Feb 8, 2025Updated last year
maoyixiu / SCAS
View on GitHub
[NeurIPS 2024] Offline Reinforcement Learning with OOD State Correction and OOD Action Suppression
☆15Oct 29, 2025Updated 8 months ago
ethanluoyc / corax
View on GitHub
Corax: Core RL in JAX
☆41Feb 22, 2024Updated 2 years ago
sfujim / TD3_BC
View on GitHub
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
☆410Dec 18, 2021Updated 4 years ago
maoyixiu / DMG
View on GitHub
[NeurIPS 2024] Doubly Mild Generalization for Offline Reinforcement Learning
☆17Oct 29, 2025Updated 8 months ago
ikostrikov / jaxrl
View on GitHub
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
☆757Oct 26, 2022Updated 3 years ago
conglu1997 / SynthER
View on GitHub
Synthetic Experience Replay
☆114Apr 16, 2026Updated 3 months ago
Facebear-ljx / DOGE
View on GitHub
The official implementation of "When Data Geometry Meets Deep Function: Generalizing Offline Reinforcement Learning" (ICLR2023)
☆44Mar 6, 2023Updated 3 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
yihaosun1124 / OfflineRL-Kit
View on GitHub
An elegant PyTorch offline reinforcement learning library for researchers.
☆393May 2, 2026Updated 2 months ago
yihaosun1124 / mobile
View on GitHub
Code for MOBILE: Model-Bellman Inconsistency Penalized Offline Policy Optimization
☆22Apr 17, 2024Updated 2 years ago
twitter / diffusion-rl
View on GitHub
☆80Dec 9, 2022Updated 3 years ago
sharkwyf / critic-guided-decision-transformer
View on GitHub
[AAAI'2024] Critic-Guided Decision Transformer for Offline Reinforcement Learning
☆18May 21, 2025Updated last year
dibyaghosh / icvf_release
View on GitHub
Public code for "Reinforcement Learning from Passive Data via Latent Intentions"
☆89Nov 19, 2023Updated 2 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
Wenxuan-Zhou / PLAS
View on GitHub
Code for Latent Action Space for Offline Reinforcement Learning [CoRL 2020]
☆54Oct 18, 2021Updated 4 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
facebookresearch / ExPLORe
View on GitHub
This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".
☆26Dec 5, 2023Updated 2 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
charleshsc / QT
View on GitHub
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
☆38Dec 30, 2024Updated last year
young-geng / SimpleSAC
View on GitHub
A simple and easy to use implementation of the soft actor-critic algorithm.
☆15Sep 2, 2022Updated 3 years ago
gwthomas / IQL-PyTorch
View on GitHub
A PyTorch implementation of Implicit Q-Learning
☆99Oct 23, 2021Updated 4 years ago
ryanxhr / POR
View on GitHub
[NeurIPS 2022 Oral] The official implementation of POR in "A Policy-Guided Imitation Approach for Offline Reinforcement Learning"
☆58Apr 6, 2023Updated 3 years ago