When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆72Jan 18, 2024Updated 2 years ago
Alternatives and similar repositories for Memory-RL
Users that are interested in Memory-RL are comparing it to the libraries listed below
Sorting:
- Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024☆24Apr 7, 2024Updated last year
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆344Aug 22, 2024Updated last year
- ☆10Jun 27, 2024Updated last year
- Artifact for paper "Chronosymbolic: Efficient CHC Solving with Symbolic Reasoning and Inductive Learning" in Python☆11Aug 4, 2024Updated last year
- [ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.☆23Apr 19, 2024Updated last year
- ☆19Apr 22, 2024Updated last year
- Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)☆90Nov 21, 2023Updated 2 years ago
- Learning diverse options through the Laplacian representation.☆23Jan 5, 2024Updated 2 years ago
- HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient☆28Updated this week
- Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…☆12Jan 31, 2023Updated 3 years ago
- CREATE Environment for long-horizon physics-puzzle tasks with diverse tools☆18Nov 22, 2022Updated 3 years ago
- ☆12Sep 7, 2024Updated last year
- Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL☆29Oct 29, 2023Updated 2 years ago
- OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning☆14Jan 27, 2026Updated last month
- On the model-based stochastic value gradient for continuous reinforcement learning☆57Mar 6, 2026Updated 2 weeks ago
- On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning☆16Apr 30, 2023Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…☆77May 28, 2024Updated last year
- ☆13May 21, 2023Updated 2 years ago
- ☆24Jan 26, 2024Updated 2 years ago
- A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning☆15Oct 22, 2023Updated 2 years ago
- TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"☆19Mar 17, 2022Updated 4 years ago
- Learning bisimulation metrics for control, particularly suited to sparse reward settings☆10Feb 28, 2023Updated 3 years ago
- ☆19Jun 25, 2023Updated 2 years ago
- Online Goal-Conditioned Reinforcement Learning in JAX. ICLR 2025 Spotlight.☆249Nov 23, 2025Updated 3 months ago
- This repository contains the code for RL for POMDPs through learning an Approximate Information State.☆22Nov 29, 2025Updated 3 months ago
- Measuring the Signal to Noise Ratio in Language Model Evaluation☆29Aug 19, 2025Updated 7 months ago
- METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)☆87Oct 15, 2023Updated 2 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆27Jan 14, 2025Updated last year
- ☆42May 11, 2022Updated 3 years ago
- ☆15Mar 26, 2024Updated last year
- Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024☆28Feb 20, 2026Updated last month
- Actor-Sharer-Learner training framework for off-policy DRL algorithms☆22Dec 29, 2024Updated last year
- [NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…☆86Apr 4, 2025Updated 11 months ago
- Implementation of SAC and TD3 based on various RNN and Transformer.☆30Sep 28, 2024Updated last year
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Jul 7, 2024Updated last year
- ☆94Jan 21, 2026Updated 2 months ago
- Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023☆22Dec 29, 2023Updated 2 years ago
- Challenging Memory-based Deep Reinforcement Learning Agents☆111Oct 27, 2024Updated last year