Toshihiro-Ota / decision-mambaView external linksLinks
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
☆49Apr 1, 2024Updated last year
Alternatives and similar repositories for decision-mamba
Users that are interested in decision-mamba are comparing it to the libraries listed below
Sorting:
- ICML'2024: Q-value Regularized Transformer for Offline Reinforcement Learning☆36Dec 30, 2024Updated last year
- ☆14Sep 29, 2025Updated 4 months ago
- [ICLR 2024 oral] Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆29Mar 1, 2024Updated last year
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- ☆11May 27, 2022Updated 3 years ago
- ☆10Jun 27, 2024Updated last year
- ☆11Oct 3, 2022Updated 3 years ago
- Non-official implementation of paper "In-context Reinforcement Learning with Algorithm Distillation"☆11Aug 15, 2024Updated last year
- Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku☆12Aug 14, 2024Updated last year
- ☆16Jan 26, 2023Updated 3 years ago
- ☆14May 17, 2024Updated last year
- Official code for ACT: Empowering Decision Transformer with Dynamic Programming via Advantage Conditioning (AAAI'24)☆17Feb 10, 2024Updated 2 years ago
- code for the paper Offline Prioritized Experience Replay☆13Jun 13, 2023Updated 2 years ago
- M-CURL: Masked Contrastive Representation Learning for Reinforcement Learning☆29Nov 5, 2020Updated 5 years ago
- ☆14Dec 5, 2024Updated last year
- Official implementation for ICLR 2025 paper "Distilling Reinforcement Learning Algorithms for In-Context Model-Based Planning"☆20Mar 5, 2025Updated 11 months ago
- On-Policy Policy Gradient Algorithms in JAX☆42Jan 25, 2024Updated 2 years ago
- ☆19Aug 21, 2024Updated last year
- Codes accompanying the paper "Offline Reinforcement Learning with Value-Based Episodic Memory" (ICLR 2022 https://arxiv.org/abs/2110.0979…☆15Mar 9, 2022Updated 3 years ago
- Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper☆21Jul 14, 2024Updated last year
- Official code for ICML 2024 paper Reinformer: Max-Return Sequence Modeling for offline RL☆46Oct 16, 2024Updated last year
- Code for AAAI 2023 paper "Hypernetworks for Zero-shot Transfer in Reinforcement Learning"☆21Apr 26, 2023Updated 2 years ago
- Code release for H-GAP Humanoid Control with a Generalist Planner☆24Nov 25, 2024Updated last year
- Official codebase for CuGRO: Continual Offline Reinforcement Learning via Diffusion-based Dual Generative Replay☆33Apr 14, 2024Updated last year
- ☆23Mar 18, 2024Updated last year
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- ☆86May 31, 2025Updated 8 months ago
- A Pytorch implemtentation of ICCV 2019 paper Face Swapping Gan (https://arxiv.org/abs/1908.05932)☆21Nov 11, 2019Updated 6 years ago
- Drop-in environment replacements that make your RL algorithm train faster.☆21Jun 19, 2024Updated last year
- Next-gen Foundation Model for Embodied AI☆25Nov 21, 2025Updated 2 months ago
- This project aims to use a combination of imitation learning and reinforcement learning in order to play Asseto Corsa by learning new pol…☆19Sep 10, 2020Updated 5 years ago
- Monte Carlo tree search in JAX, with functionality to continue search from a previous subtree☆26May 2, 2025Updated 9 months ago
- Q-learning with Adjoint Matching☆49Jan 31, 2026Updated 2 weeks ago
- speed-running solving robot manipulation tasks☆24Oct 31, 2024Updated last year
- standalone and "ros-free" python wrapper of voxblox (online SDF generator from point clouds)☆21Nov 7, 2022Updated 3 years ago
- 🤖 Elegant implementations of offline safe RL algorithms in PyTorch☆231Sep 13, 2024Updated last year
- Official codebase for GTA: Generative Trajectory Augmentation with Guidance for Offline Reinforcement Learning.☆29Nov 12, 2024Updated last year
- (NeurIPS '22) LISA: Learning Interpretable Skill Abstractions - A framework for unsupervised skill learning using Imitation☆29Feb 22, 2023Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆62Jan 2, 2026Updated last month