Challenging Memory-based Deep Reinforcement Learning Agents
☆113Oct 27, 2024Updated last year
Alternatives and similar repositories for endless-memory-gym
Users that are interested in endless-memory-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆210Jun 18, 2024Updated last year
- Baseline implementation of recurrent PPO using truncated BPTT☆161Apr 28, 2024Updated 2 years ago
- Partially Observable Process Gym☆218Updated this week
- Evaluating long-term memory of reinforcement learning algorithms☆176Jun 23, 2023Updated 2 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Mar 12, 2025Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆63Aug 3, 2023Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆239Nov 24, 2025Updated 6 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆23Oct 28, 2024Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆57May 21, 2023Updated 3 years ago
- ☆10Jun 27, 2024Updated last year
- JAX implementation of RL algorithms and vectorized environments☆51Dec 26, 2023Updated 2 years ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 5 months ago
- ☆96Jan 21, 2026Updated 4 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆52May 23, 2025Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆38May 19, 2023Updated 3 years ago
- ☆95Feb 16, 2026Updated 3 months ago
- ☆19Mar 1, 2023Updated 3 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆265Oct 31, 2025Updated 6 months ago
- Clean single-file implementation of offline RL algorithms in JAX☆177Nov 24, 2025Updated 6 months ago
- ☆19Nov 25, 2022Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆74Aug 31, 2024Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 3 years ago
- Accelerated minigrid environments with JAX☆170Oct 20, 2025Updated 7 months ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- ☆23Aug 19, 2022Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆28Jan 14, 2025Updated last year
- An API conversion tool for popular external reinforcement learning environments☆210May 10, 2026Updated 2 weeks ago
- ExORL: Exploratory Data for Offline Reinforcement Learning☆132Feb 8, 2022Updated 4 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆127Aug 30, 2024Updated last year
- ☆55Feb 28, 2024Updated 2 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 4 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆406Feb 10, 2026Updated 3 months ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- Jaxpr Visualisation Tool☆36Dec 22, 2024Updated last year
- ☆36Nov 22, 2024Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆115Apr 16, 2026Updated last month
- Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.☆44Apr 25, 2026Updated 3 weeks ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Mar 29, 2024Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago