Div-Infinity/XQL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Div-Infinity/XQL)

Div-Infinity / XQL

Extreme Q-Learning: Max Entropy RL without Entropy

☆88

Alternatives and similar repositories for XQL

Users that are interested in XQL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
d5rlbenchmark / d5rl
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
LAMDA-RL / PRDC
View on GitHub
Author's PyTorch implementation of ICML'23 paper "Policy Regularization with Dataset Constraint for Offline Reinforcement Learning" for D…
☆18Nov 8, 2024Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
shidilrzf / Anti-exploration-RL
View on GitHub
Anti exploration in offline reinforcement learning
☆11May 17, 2021Updated 5 years ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆29Jan 14, 2025Updated last year
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
danijar / crafter-baselines
View on GitHub
Docker containers of baseline agents for the Crafter environment
☆30Dec 14, 2021Updated 4 years ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
sfujim / TD7
View on GitHub
Author's PyTorch implementation of TD7 for online and offline RL
☆169Sep 12, 2023Updated 2 years ago
philippe-eecs / IDQL
View on GitHub
Repo for Implicit Diffusion Q-Learning
☆126Dec 5, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
microsoft / lightATAC
View on GitHub
A lightweight reimplementation of Adversarially Trained Actor Critic
☆19Mar 19, 2026Updated 4 months ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
dmksjfl / MCQ
View on GitHub
Code for Mildly Conservative Q-learning for Offline Reinforcement Learning (NeurIPS 2022)
☆64Apr 29, 2024Updated 2 years ago
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
LAMDA-RL / OfflineRL-Lib
View on GitHub
Benchmarked implementations of Offline RL Algorithms.
☆77Mar 4, 2025Updated last year
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
ikostrikov / rlpd
View on GitHub
☆409Feb 13, 2023Updated 3 years ago
facebookresearch / MRQ
View on GitHub
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆153Apr 7, 2026Updated 3 months ago
arnavkj1995 / VSG
View on GitHub
Learning Robust Dynamics Through Variational Sparse Gating
☆20Oct 19, 2022Updated 3 years ago
zhihanyang2022 / off-policy-continuous-control
View on GitHub
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆93Nov 21, 2023Updated 2 years ago
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,367Aug 3, 2023Updated 2 years ago
zwfightzw / Meta-Critic
View on GitHub
☆11Oct 19, 2020Updated 5 years ago
ltlhuuu / A2PR
View on GitHub
[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…
☆34May 31, 2024Updated 2 years ago
ikostrikov / implicit_q_learning
View on GitHub
☆330Jan 23, 2022Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
pcchenxi / LAPO-offlienRL
View on GitHub
☆16Apr 14, 2026Updated 3 months ago
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
quasimetric-learning / quasimetric-rl
View on GitHub
Open source code for paper "Optimal Goal-Reaching Reinforcement Learning via Quasimetric Learning" ICML 2023
☆61May 19, 2025Updated last year
csmile-1006 / PreferenceTransformer
View on GitHub
Preference Transformer: Modeling Human Preferences using Transformers for RL (ICLR2023 Accepted)
☆168Oct 15, 2023Updated 2 years ago
jon--lee / decision-pretrained-transformer
View on GitHub
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…
☆79May 28, 2024Updated 2 years ago
typoverflow / WiseRL
View on GitHub
PyTorch implementations for Offline Preference-Based RL (PbRL) algorithms
☆21Mar 24, 2025Updated last year