d5rlbenchmark/d5rl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/d5rlbenchmark/d5rl)

d5rlbenchmark / d5rl

☆31

Alternatives and similar repositories for d5rl

Users that are interested in d5rl are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dmksjfl / SEABO
View on GitHub
Official code for ICLR 2024 paper, SEABO: A Simple Search-Based Method for Offline Imitation Learning
☆12Jan 19, 2024Updated 2 years ago
Lifelong-ML / offline-compositional-rl-datasets
View on GitHub
☆21Mar 19, 2024Updated 2 years ago
FelipeNuti / diffusion-relative-rewards
View on GitHub
Codebase for Extracting Reward Functions from Diffusion Models
☆16Dec 7, 2023Updated 2 years ago
Div-Infinity / XQL
View on GitHub
Extreme Q-Learning: Max Entropy RL without Entropy
☆88Feb 14, 2023Updated 3 years ago
jianlanluo / SAQ
View on GitHub
☆34Jun 9, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
SonyResearch / simba
View on GitHub
☆128Feb 25, 2025Updated last year
TianyuCodings / Diffusion_Trusted_Q_Learning
View on GitHub
[NeuIPS2024 DTQL] Diffusion Trusted Q-Learning for Offline RL — Official PyTorch Implementation
☆27May 31, 2024Updated 2 years ago
martius-lab / GateL0RD-paper
View on GitHub
Code for the paper: Sparsely Changing Latent States for Prediction and Planning in Partially Observable Domains
☆11Nov 12, 2021Updated 4 years ago
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
OffDynamicsRL / off-dynamics-rl
View on GitHub
☆65Jan 30, 2026Updated 5 months ago
0xWelt / BibTeX-Formatter
View on GitHub
Format your bibtex (.bib) file to help standardize citations for conference and journal submissions
☆14Nov 23, 2025Updated 7 months ago
sail-sg / offbench
View on GitHub
☆16Jun 1, 2023Updated 3 years ago
HauffQian / DGAP
View on GitHub
☆14May 13, 2025Updated last year
yueyang130 / SEEM
View on GitHub
Official code of paper Understanding, Predicting and Better Resolving Q-Value Divergence in Offline-RL
☆24Oct 30, 2023Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
sail-sg / edp
View on GitHub
[NeurIPS 2023] Efficient Diffusion Policy
☆113Oct 31, 2023Updated 2 years ago
kschweig / OfflineRL
View on GitHub
Experiment for Understanding the Effects of Dataset Characteristics on Offline Reinforcement Learning
☆26Jan 16, 2023Updated 3 years ago
uber-research / D3G
View on GitHub
Estimating Q(s,s') with Deep Deterministic Dynamics Gradients
☆32Feb 21, 2020Updated 6 years ago
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆436Jan 14, 2026Updated 6 months ago
Baichenjia / PBRL
View on GitHub
Pessimistic Bootstrapping for Uncertainty-Driven Offline Reinforcement Learning
☆29Feb 21, 2022Updated 4 years ago
ml-jku / OfflineRL
View on GitHub
☆31Jan 16, 2023Updated 3 years ago
chenran-li / RQL-release
View on GitHub
(NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value
☆35Mar 29, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
vv19 / rendiff
View on GitHub
☆28Aug 6, 2024Updated last year
ambujtewari / stats701-winter2021
View on GitHub
Theory of Reinforcement Learning
☆18Apr 20, 2021Updated 5 years ago
snu-mllab / EDAC
View on GitHub
Official PyTorch implementation of "Uncertainty-Based Offline Reinforcement Learning with Diversified Q-Ensemble" (NeurIPS'21)
☆80Aug 14, 2022Updated 3 years ago
Zhendong-Wang / Diffusion-Policies-for-Offline-RL
View on GitHub
☆431Apr 29, 2024Updated 2 years ago
ALRhub / d3il
View on GitHub
[ICLR 2024] Official implementation for "Towards Diverse Behaviors: A Benchmark for Imitation Learning with Human Demonstrations"
☆108Feb 17, 2025Updated last year
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
lafmdp / HIDIL
View on GitHub
[NeurIPS'20] Code for the paper "Offline Imitation Learning with a Misspecified Simulator"
☆12Nov 24, 2021Updated 4 years ago
tesslerc / TD3-JAX
View on GitHub
A JAX Implementation of the Twin Delayed DDPG Algorithm
☆35Mar 12, 2020Updated 6 years ago
LAMDA-RL / OfflineRL-Lib
View on GitHub
Benchmarked implementations of Offline RL Algorithms.
☆77Mar 4, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
clvrai / skill-chaining
View on GitHub
Adversarial Skill Chaining for Long-Horizon Robot Manipulation via Terminal State Regularization (CoRL 2021)
☆38May 3, 2022Updated 4 years ago
steventango / jumpstart-rl
View on GitHub
Implementation of Jump-Start Reinforcement Learning (JSRL) with Stable Baselines3
☆37Jan 12, 2024Updated 2 years ago
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆56May 21, 2023Updated 3 years ago
dmksjfl / PAR
View on GitHub
Official code for Cross-Domain Policy Adaptation by Capturing Representation Mismatch (ICML 2024)
☆15Aug 15, 2025Updated 11 months ago