Challenging Memory-based Deep Reinforcement Learning Agents
☆112Oct 27, 2024Updated last year
Alternatives and similar repositories for endless-memory-gym
Users that are interested in endless-memory-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆206Jun 18, 2024Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Partially Observable Process Gym☆213Jun 12, 2025Updated 9 months ago
- Evaluating long-term memory of reinforcement learning algorithms☆165Jun 23, 2023Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Mar 12, 2025Updated last year
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆62Aug 3, 2023Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆238Nov 24, 2025Updated 4 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 2 years ago
- ☆10Jun 27, 2024Updated last year
- JAX implementation of RL algorithms and vectorized environments☆51Dec 26, 2023Updated 2 years ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 3 months ago
- ☆94Jan 21, 2026Updated 2 months ago
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆37May 19, 2023Updated 2 years ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆45May 23, 2025Updated 10 months ago
- ☆93Feb 16, 2026Updated last month
- ☆19Mar 1, 2023Updated 3 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆264Oct 31, 2025Updated 4 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆174Nov 24, 2025Updated 4 months ago
- ☆19Nov 25, 2022Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆74Aug 31, 2024Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 3 years ago
- Accelerated minigrid environments with JAX☆163Oct 20, 2025Updated 5 months ago
- ☆23Aug 19, 2022Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆27Jan 14, 2025Updated last year
- An API conversion tool for popular external reinforcement learning environments☆205Dec 15, 2025Updated 3 months ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆129Feb 8, 2022Updated 4 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆126Aug 30, 2024Updated last year
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆381Feb 10, 2026Updated last month
- ☆55Feb 28, 2024Updated 2 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 3 years ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Jaxpr Visualisation Tool☆36Dec 22, 2024Updated last year
- Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.☆41Jan 16, 2026Updated 2 months ago
- ☆35Nov 22, 2024Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆112May 27, 2024Updated last year
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Mar 29, 2024Updated last year
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago