Dragon-Zhuang/Reinformer

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Dragon-Zhuang/Reinformer)

Dragon-Zhuang / Reinformer

Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL

☆49

Alternatives and similar repositories for Reinformer

Users that are interested in Reinformer are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

charleshsc / QT
View on GitHub
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
☆38Dec 30, 2024Updated last year
kristery / Elastic-DT
View on GitHub
[NeurIPS 2023] Implementation of Elastic Decision Transformer
☆40Oct 12, 2023Updated 2 years ago
KaiYan289 / RL_as_Vitamin_for_Online_Decision_Transformers
View on GitHub
☆16Dec 5, 2024Updated last year
RajGhugare19 / stitching-is-combinatorial-generalisation
View on GitHub
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆25Apr 19, 2024Updated 2 years ago
adityab / CrossQ
View on GitHub
Official code release for "CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity"
☆95Jun 4, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆29Jan 14, 2025Updated last year
Improbable-AI / harness-offline-rl
View on GitHub
Official implementation of Harnessing Mixed Offline Reinforcement Learning Datasets via Trajectory Reweighting
☆16Feb 14, 2024Updated 2 years ago
keraJLi / synthetic-gymnax
View on GitHub
Drop-in environment replacements that make your RL algorithm train faster.
☆22Jun 19, 2024Updated 2 years ago
corl-team / headless-ad
View on GitHub
Official Implementation for "In-Context Reinforcement Learning for Variable Action Spaces"
☆92Feb 11, 2024Updated 2 years ago
UT-Austin-RPL / amago
View on GitHub
off-policy RL on long sequences
☆169May 29, 2026Updated 2 months ago
metadriverse / pvp
View on GitHub
Official release for the code used in paper: Learning from Active Human Involvement through Proxy Value Propagation (NeurIPS 2023 Spotlig…
☆34Jan 16, 2025Updated last year
Dragon-Zhuang / BPPO
View on GitHub
Author's Pytorch implementation of ICLR2023 paper Behavior Proximal Policy Optimization (BPPO).
☆95Dec 13, 2023Updated 2 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
ltlhuuu / A2PR
View on GitHub
[ICML 2024] The offical implementation of A2PR, a simple way to achieve SOTA in offline reinforcement learning with an adaptive advantage…
☆34May 31, 2024Updated 2 years ago
google-deepmind / dmc_vision_benchmark
View on GitHub
☆34Jun 21, 2024Updated 2 years ago
ec2604 / ContraBAR
View on GitHub
☆13May 21, 2023Updated 3 years ago
machinestein / Zero-Shot-Off-Policy-Learning
View on GitHub
Official Pytorch Implementation of "Zero-Shot Off-Policy Learning" (ICML 2026)
☆25Feb 16, 2026Updated 5 months ago
tinkoff-ai / CORL
View on GitHub
High-quality single-file implementations of SOTA Offline and Offline-to-Online RL algorithms: AWAC, BC, CQL, DT, EDAC, IQL, SAC-N, TD3+BC…
☆1,370Aug 3, 2023Updated 2 years ago
chongyi-zheng / td_infonce
View on GitHub
Implementations of Temporal Difference InfoNCE (TD InfoNCE)
☆35Nov 13, 2023Updated 2 years ago
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
CEC-Agent / CEC
View on GitHub
Official Implementation of NeurIPS'23 Paper "Cross-Episodic Curriculum for Transformer Agents"
☆32Oct 12, 2023Updated 2 years ago
robfiras / s2pg
View on GitHub
Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"
☆25May 5, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
beanie00 / Decision-ConvFormer
View on GitHub
[ICLR 2024 Spotlight] Code for the paper "Decision ConvFormer: Local Filtering in MetaFormer is Sufficient for Decision Making"
☆12Apr 22, 2024Updated 2 years ago
Improbable-AI / dw-offline-rl
View on GitHub
Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets
☆25Jan 29, 2024Updated 2 years ago
junsu-kim97 / PIG
View on GitHub
PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).
☆21Mar 4, 2023Updated 3 years ago
scottemmons / rvs
View on GitHub
Reinforcement Learning via Supervised Learning
☆72May 16, 2022Updated 4 years ago
jsikyoon / OCRL
View on GitHub
Object-Centric-Representation Library (OCRL): This repo is to explore OCR on various downstream tasks from supervised learning tasks to R…
☆12Feb 23, 2024Updated 2 years ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
aielawady / relic
View on GitHub
☆12Sep 7, 2024Updated last year
SageCao1125 / MambaDM
View on GitHub
☆12Jul 8, 2024Updated 2 years ago
quanser / ACC-Competition-2025
View on GitHub
☆11Apr 17, 2026Updated 3 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AlexGoldie / learn-rl-algorithms
View on GitHub
Official implementation for "How Should We Meta-Learn Reinforcement Learning Algorithms?"
☆23Sep 7, 2025Updated 10 months ago
42jaylonw / shifu
View on GitHub
Lightweight Isaac Gym Environment Builder
☆40Nov 30, 2022Updated 3 years ago
brownirl / lambda_discrepancy
View on GitHub
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆24Oct 28, 2024Updated last year
Lei-Kun / Uni-O4
View on GitHub
Author's Pytorch implementation of our ICLR 2024 paper "Uni-O4"
☆82Jan 15, 2025Updated last year
junsu-kim97 / HIGL
View on GitHub
PyTorch code accompanying the paper "Landmark-Guided Subgoal Generation in Hierarchical Reinforcement Learning" (NeurIPS 2021).
☆32Oct 27, 2021Updated 4 years ago
EmptyJackson / policy-guided-diffusion
View on GitHub
Official implementation of the RLC 2024 paper "Policy-Guided Diffusion"
☆153Jul 19, 2024Updated 2 years ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆321Jul 21, 2025Updated last year