When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆70Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for Memory-RL
Users that are interested in Memory-RL are comparing it to the libraries listed below
Sorting:
- ☆10Jun 27, 2024Updated last year
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆24Apr 7, 2024Updated last year
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆342Aug 22, 2024Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Nov 21, 2023Updated 2 years ago
- ☆19Apr 22, 2024Updated last year
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated last month
- ☆19Jun 25, 2023Updated 2 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- ☆13May 21, 2023Updated 2 years ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆78May 28, 2024Updated last year
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆240Nov 23, 2025Updated 3 months ago
- ☆12Sep 7, 2024Updated last year
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆15Oct 22, 2023Updated 2 years ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆28Aug 19, 2025Updated 6 months ago
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Jan 7, 2026Updated last month
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated 10 months ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆25Jan 14, 2025Updated last year
- UAV offloading based on QMIX☆15Oct 12, 2023Updated 2 years ago
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Feb 20, 2026Updated last week
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Jul 7, 2024Updated last year
- Benchmarking RL generalization in an interpretable way.☆175Nov 20, 2025Updated 3 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆86Oct 15, 2023Updated 2 years ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆63Jan 2, 2026Updated 2 months ago
- MR.Q is a general-purpose model-free reinforcement learning algorithm.☆143Jun 23, 2025Updated 8 months ago
- ☆19Mar 1, 2023Updated 3 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- Scripts to recreate the D4RL datasets with Minari☆25Jul 21, 2025Updated 7 months ago
- A dataloader, but for JAX☆20May 17, 2024Updated last year
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆22Nov 29, 2025Updated 3 months ago
- Learning Robust Dynamics Through Variational Sparse Gating☆20Oct 19, 2022Updated 3 years ago
- Code of the Paper "Time-Efficient Reinforcement Learning with Stochastic Stateful Policies"☆25May 5, 2024Updated last year