Toshihiro-Ota/decision-mamba

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Toshihiro-Ota/decision-mamba)

Toshihiro-Ota / decision-mamba

Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces

☆53

Alternatives and similar repositories for decision-mamba

Users that are interested in decision-mamba are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

charleshsc / QT
View on GitHub
ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning
☆38Dec 30, 2024Updated last year
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
cinemere / ad-icrl
View on GitHub
Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"
☆13Aug 15, 2024Updated last year
DanieleGammelli / graph-rl-for-network-optimization
View on GitHub
☆16Jan 26, 2023Updated 3 years ago
sail-sg / OPER
View on GitHub
code for the paper Offline Prioritized Experience Replay
☆12Jun 13, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
abhimasand / AI-Assetto-Corsa
View on GitHub
This project aims to use a combination of imitation learning and reinforcement learning in order to play Asseto Corsa by learning new pol…
☆25Sep 10, 2020Updated 5 years ago
kwanyoungpark / MAC
View on GitHub
Code for Scalable Offline Model-Based RL with Action chunking
☆30Feb 20, 2026Updated 5 months ago
tyler-ingebrand / FunctionEncoder
View on GitHub
☆14Sep 29, 2025Updated 10 months ago
corl-team / ad-eps
View on GitHub
Official Implementation for "In-Context Reinforcement Learning from Noise Distillation"
☆35Sep 18, 2024Updated last year
tung-nd / cwbc
View on GitHub
☆11Oct 3, 2022Updated 3 years ago
zoharri / mamba
View on GitHub
Meta-RL Model-Based Algorithm
☆46Apr 30, 2025Updated last year
scrambledpie / GPVAE
View on GitHub
Train and visualise a latent variable model of moving objects.
☆16Apr 28, 2020Updated 6 years ago
jaehyeon-son / dicp
View on GitHub
Official implementation for ICLR 2025 paper "Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning"
☆22Mar 5, 2025Updated last year
Algoryx / AGXUnreal
View on GitHub
AGX Dynamics for Unreal plugin.
☆13Jul 3, 2026Updated 3 weeks ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
yun-kwak / decision-transformer-jax
View on GitHub
Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku
☆13Aug 14, 2024Updated last year
LAMDA-RL / ACT
View on GitHub
Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)
☆17Feb 10, 2024Updated 2 years ago
sfujim / SR-DICE
View on GitHub
Author's PyTorch implementation of SR-DICE for marginalized importance sampling
☆28Dec 7, 2021Updated 4 years ago
facebookresearch / hgap
View on GitHub
Code release for H-GAP Humanoid Control with a Generalist Planner
☆25Nov 25, 2024Updated last year
YiqinYang / VEM
View on GitHub
Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…
☆15Mar 9, 2022Updated 4 years ago
GeWu-Lab / DepthHelps-IROS2024
View on GitHub
☆19Aug 21, 2024Updated last year
sukhijab / maxinforl_jax
View on GitHub
☆29Jan 8, 2026Updated 6 months ago
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
wenchiyang / pls
View on GitHub
☆16May 17, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Matt00n / PolicyGradientsJax
View on GitHub
On-Policy Policy Gradient Algorithms in JAX
☆44Jan 25, 2024Updated 2 years ago
liuzuxin / OSRL
View on GitHub
🤖 Elegant implementations of offline safe RL algorithms in PyTorch
☆246Sep 13, 2024Updated last year
PKU-RL / PTGM
View on GitHub
[ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning
☆30Mar 1, 2024Updated 2 years ago
SAIC-MONTREAL / hyperzero
View on GitHub
Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"
☆24Apr 26, 2023Updated 3 years ago
Snagnar / Hieros
View on GitHub
Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper
☆23Jul 14, 2024Updated 2 years ago
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
luchris429 / popjaxrl
View on GitHub
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆117Dec 5, 2023Updated 2 years ago
rlatjddbs / SSD
View on GitHub
☆19Jan 2, 2024Updated 2 years ago
ColinQiyangLi / dqc
View on GitHub
Decoupled Q-Chunking
☆74May 3, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
tsinghua-fib-lab / DyPS
View on GitHub
code for DyPS: Dynamic Parameter Sharing in Multi-Agent Reinforcement Learning for Spatio-Temporal Resource Allocation
☆29Oct 27, 2024Updated last year
airs-cuhk / airsoul
View on GitHub
Next-gen Foundation Model for Embodied AI
☆32Apr 7, 2026Updated 3 months ago
FLAIROx / popjym
View on GitHub
POPGym Library in JAX
☆14Apr 15, 2024Updated 2 years ago
mxu34 / prompt-dt
View on GitHub
Official code repository for Prompt-DT.
☆123Aug 3, 2022Updated 3 years ago
lowrollr / mctx-az
View on GitHub
Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree
☆27May 2, 2025Updated last year
kristery / Elastic-DT
View on GitHub
[NeurIPS 2023] Implementation of Elastic Decision Transformer
☆40Oct 12, 2023Updated 2 years ago