MarcoMeter/endless-memory-gym

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MarcoMeter/endless-memory-gym)

MarcoMeter / endless-memory-gym

Challenging Memory-based Deep Reinforcement Learning Agents

☆114

Alternatives and similar repositories for endless-memory-gym

Users that are interested in endless-memory-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MarcoMeter / episodic-transformer-memory-ppo
View on GitHub
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆212Jun 18, 2024Updated 2 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
View on GitHub
Baseline implementation of recurrent PPO using truncated BPTT
☆161Apr 28, 2024Updated 2 years ago
proroklab / popgym
View on GitHub
Partially Observable Process Gym
☆227Jun 11, 2026Updated last month
jurgisp / memory-maze
View on GitHub
Evaluating long-term memory of reinforcement learning algorithms
☆180Jun 23, 2023Updated 3 years ago
MarcoMeter / neroRL
View on GitHub
Deep Reinforcement Learning Framework done with PyTorch
☆43Mar 12, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
tinkoff-ai / ReBRAC
View on GitHub
Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC
☆63Aug 3, 2023Updated 2 years ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 8 months ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆56May 21, 2023Updated 3 years ago
brownirl / lambda_discrepancy
View on GitHub
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆24Oct 28, 2024Updated last year
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
RPegoud / jym
View on GitHub
JAX implementation of RL algorithms and vectorized environments
☆50Dec 26, 2023Updated 2 years ago
symoon11 / dreamerv3-flax
View on GitHub
Flax Implementation of DreamerV3 on Crafter
☆18Nov 29, 2025Updated 7 months ago
DramaCow / jaxued
View on GitHub
☆98Jan 21, 2026Updated 6 months ago
danijar / diamond_env
View on GitHub
Standardized Minecraft Diamond Environment for Reinforcement Learning
☆40May 19, 2023Updated 3 years ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
dunnolab / vintix
View on GitHub
Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025
☆51May 23, 2025Updated last year
Reytuag / transformerXL_PPO_JAX
View on GitHub
☆96Feb 16, 2026Updated 5 months ago
bolt-research / popgym-arcade
View on GitHub
Atari-style POMDPs
☆34Jun 5, 2026Updated last month
google-deepmind / nao_top10
View on GitHub
☆19Mar 1, 2023Updated 3 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
taodav / pobax
View on GitHub
Partially Observable Benchmarks in JAX
☆25Apr 30, 2026Updated 2 months ago
keraJLi / rejax
View on GitHub
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!
☆274Jun 10, 2026Updated last month
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆182Jun 5, 2026Updated last month
astanic / crafter-ood
View on GitHub
☆19Nov 25, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
max7born / decision-lstm
View on GitHub
Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…
☆28Mar 24, 2023Updated 3 years ago
kvfrans / powderworld
View on GitHub
Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions
☆74Aug 31, 2024Updated last year
epignatelli / navix
View on GitHub
Accelerated minigrid environments with JAX
☆175Oct 20, 2025Updated 9 months ago
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago
corl-team / katakomba
View on GitHub
Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)
☆43Aug 22, 2023Updated 2 years ago
Farama-Foundation / Shimmy
View on GitHub
PettingZoo and Gymnasium bindings for popular reinforcement learning environments outside of Farama
☆225Updated this week
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
takuseno / d4rl-atari
View on GitHub
Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)
☆128Aug 30, 2024Updated last year
sparisi / cbet
View on GitHub
Change-Based Exploration Transfer
☆35Apr 24, 2022Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
denisyarats / exorl
View on GitHub
ExORL: Exploratory Data for Offline Reinforcement Learning
☆138Feb 8, 2022Updated 4 years ago
rraileanu / idaac
View on GitHub
☆55Feb 28, 2024Updated 2 years ago
MichaelTMatthews / Craftax
View on GitHub
(Crafter + NetHack) in JAX. ICML 2024 Spotlight.
☆426Jun 20, 2026Updated last month
lockwo / distreqx
View on GitHub
Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.
☆47Jul 10, 2026Updated last week
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
seohongpark / HILP
View on GitHub
Foundation Policies with Hilbert Representations (ICML 2024)
☆104Sep 29, 2025Updated 9 months ago