Toshihiro-Ota / decision-mamba
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
☆38Updated 11 months ago
Alternatives and similar repositories for decision-mamba:
Users that are interested in decision-mamba are comparing it to the libraries listed below
- Meta-RL Model-Based Algorithm☆29Updated 9 months ago
- ☆74Updated last week
- PWM: Policy Learning with Large World Models☆42Updated last week
- When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)☆60Updated last year
- HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)☆76Updated 3 months ago
- [ICML 2023] Pre-train world model-based agents with different unsupervised strategies, fine-tune the agent's components selectively, and …☆37Updated last year
- ☆21Updated 4 months ago
- Learning to Modulate pre-trained Models in RL (Decision Transformer, LoRA, Fine-tuning)☆54Updated 4 months ago
- Code release for "HarmonyDream: Task Harmonization Inside World Models" (ICML 2024), https://arxiv.org/abs/2310.00344☆31Updated 8 months ago
- Unofficial PyTorch implementation (replicating paper results) of Implicit Q-Learning (In-sample Q-Learning) for offline RL☆23Updated 3 months ago
- ☆77Updated 8 months ago
- Pytorch Implementation of Learning Latent Dynamic Robust Representations for World Models☆16Updated 9 months ago
- Controllability-Aware Unsupervised Skill Discovery (ICML 2023)☆23Updated last year
- Official implementation of NeurIPS'23 paper, Beyond Uniform Sampling: Offline Reinforcement Learning with Imbalanced Datasets☆24Updated last year
- Pre-Training Goal-based Models for Sample-Efficient Reinforcement Learning☆25Updated last year
- Repo for Implicit Diffusion Q-Learning☆104Updated last year
- Official codebase for "Privileged Sensing Scaffolds Reinforcement Learning", contains the Scaffolder algorithm and Sensory Scaffolding Su…☆24Updated 10 months ago
- [NeurIPS 2023] Implementation of Elastic Decision Transformer☆34Updated last year
- Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization☆25Updated last year
- ☆39Updated 3 months ago
- Transformer-based World Models☆76Updated last year
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆97Updated last year
- [ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Reg…☆44Updated last year
- Synthetic Experience Replay☆86Updated 9 months ago
- JAX implementation of WSRL and RL baselines | ICLR 2025☆30Updated last month
- Resilient Model-Based RL by Regularizing Posterior Predictability☆16Updated 11 months ago
- ☆36Updated 2 months ago
- Foundation Policies with Hilbert Representations (ICML 2024)☆76Updated 10 months ago
- ☆58Updated 3 months ago