twni2016/Memory-RL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/twni2016/Memory-RL)

twni2016 / Memory-RL

When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)

☆73

Alternatives and similar repositories for Memory-RL

Users that are interested in Memory-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

twni2016 / self-predictive-rl
View on GitHub
Bridging State and History Representations: Understanding Self-Predictive RL, ICLR 2024
☆27Apr 26, 2026Updated 2 months ago
twni2016 / pomdp-baselines
View on GitHub
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆348Apr 26, 2026Updated 2 months ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
Chronosymbolic / Chronosymbolic-Learning
View on GitHub
Artifact for paper "Chronosymbolic: Efficient CHC Solving with Symbolic Reasoning and Inductive Learning" in Python
☆11Aug 4, 2024Updated last year
zhihanyang2022 / off-policy-continuous-control
View on GitHub
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆93Nov 21, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
RajGhugare19 / stitching-is-combinatorial-generalisation
View on GitHub
[ICLR 2024] Closing the Gap between TD Learning and Supervised Learning - A Generalisation Point of View.
☆25Apr 19, 2024Updated 2 years ago
mklissa / dceo
View on GitHub
Learning diverse options through the Laplacian representation.
☆23Jan 5, 2024Updated 2 years ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
tinkoff-ai / cnf
View on GitHub
Official implementation for "Let Offline RL Flow: Training Conservative Agents in the Latent Space of Normalizing Flows", NeurIPS 2022, O…
☆12Jan 31, 2023Updated 3 years ago
aielawady / relic
View on GitHub
☆12Sep 7, 2024Updated last year
clvrai / create
View on GitHub
CREATE Environment for long-horizon physics-puzzle tasks with diverse tools
☆18Nov 22, 2022Updated 3 years ago
facebookresearch / svg
View on GitHub
On the model-based stochastic value gradient for continuous reinforcement learning
☆58Mar 6, 2026Updated 4 months ago
luchris429 / popjaxrl
View on GitHub
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆116Dec 5, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
frankroeder / lanro-gym
View on GitHub
OpenAI gym environments for goal-conditioned and language-conditioned reinforcement learning
☆14Jan 27, 2026Updated 5 months ago
jon--lee / decision-pretrained-transformer
View on GitHub
Implemention of the Decision-Pretrained Transformer (DPT) from the paper Supervised Pretraining Can Learn In-Context Reinforcement Learni…
☆79May 28, 2024Updated 2 years ago
ec2604 / ContraBAR
View on GitHub
☆13May 21, 2023Updated 3 years ago
mlpc-ucsd / XTRA
View on GitHub
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement Learning
☆16Apr 30, 2023Updated 3 years ago
zhaoyi11 / tcrl
View on GitHub
☆26Jan 26, 2024Updated 2 years ago
metekemertas / RobustBisimulation
View on GitHub
Learning bisimulation metrics for control, particularly suited to sparse reward settings
☆11Feb 28, 2023Updated 3 years ago
Stilwell-Git / Randomized-Return-Decomposition
View on GitHub
TensorFlow implementation for our paper "Learning Long-Term Reward Redistribution via Randomized Return Decomposition"
☆19Mar 17, 2022Updated 4 years ago
hari-sikchi / DVL
View on GitHub
A Dual-RL method DVL: Dual-V Learning for offline and online reinforcement learning
☆16Oct 22, 2023Updated 2 years ago
hwang-ua / inac_pytorch
View on GitHub
☆20Jun 25, 2023Updated 3 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
automl / arlbench
View on GitHub
HPO and Architecture Benchmarking for RL: Dynamically, Reactive and Efficient
☆32Jun 16, 2026Updated last month
info-structures / ais
View on GitHub
This repository contains the code for RL for POMDPs through learning an Approximate Information State.
☆23Nov 29, 2025Updated 7 months ago
orybkin / lexa-benchmark
View on GitHub
☆42May 11, 2022Updated 4 years ago
seohongpark / METRA
View on GitHub
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆92Oct 15, 2023Updated 2 years ago
esraaelelimy / rtus
View on GitHub
Real-Time RTUs
☆12Mar 20, 2026Updated 4 months ago
XinJingHao / Actor-Sharer-Learner
View on GitHub
Actor-Sharer-Learner training framework for off-policy DRL algorithms
☆22Dec 29, 2024Updated last year
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
kevslinger / DTQN
View on GitHub
Deep Transformer Q-Networks for Partially Observable Reinforcement Learning
☆176Jul 7, 2024Updated 2 years ago
mazpie / genrl
View on GitHub
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆87Apr 4, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
samlobel / CFN
View on GitHub
Accompanying Code for "Flipping Coins to Estimate Pseudocounts for Exploration in Reinforcement Learning", ICML 2023
☆25Dec 29, 2023Updated 2 years ago
sukhijab / maxinforl_jax
View on GitHub
☆29Jan 8, 2026Updated 6 months ago
FanmingL / Recurrent-Offpolicy-RL
View on GitHub
Implementation of SAC and TD3 based on various RNN and Transformer.
☆32Sep 28, 2024Updated last year
zwfightzw / Meta-Critic
View on GitHub
☆11Oct 19, 2020Updated 5 years ago
sfujim / TD7
View on GitHub
Author's PyTorch implementation of TD7 for online and offline RL
☆169Sep 12, 2023Updated 2 years ago
a1193095382 / UAV-offloading-with-QMIX
View on GitHub
UAV offloading based on QMIX
☆17Oct 12, 2023Updated 2 years ago
DramaCow / jaxued
View on GitHub
☆98Jan 21, 2026Updated 6 months ago