Challenging Memory-based Deep Reinforcement Learning Agents
☆113Oct 27, 2024Updated last year
Alternatives and similar repositories for endless-memory-gym
Users that are interested in endless-memory-gym are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Clean baseline implementation of PPO using an episodic TransformerXL memory☆212Jun 18, 2024Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆161Apr 28, 2024Updated 2 years ago
- Partially Observable Process Gym☆225Jun 11, 2026Updated 3 weeks ago
- Evaluating long-term memory of reinforcement learning algorithms☆180Jun 23, 2023Updated 3 years ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Mar 12, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Author's implementation of ReBRAC, a minimalist improvement upon TD3+BC☆63Aug 3, 2023Updated 2 years ago
- Simple single-file baselines for Q-Learning in pure-GPU setting☆243Nov 24, 2025Updated 7 months ago
- Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy☆24Oct 28, 2024Updated last year
- Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch☆56May 21, 2023Updated 3 years ago
- ☆10Jun 27, 2024Updated 2 years ago
- JAX implementation of RL algorithms and vectorized environments☆50Dec 26, 2023Updated 2 years ago
- Flax Implementation of DreamerV3 on Crafter☆18Nov 29, 2025Updated 7 months ago
- ☆98Jan 21, 2026Updated 5 months ago
- Vintix: Action Model via In-Context Reinforcement Learning - - — ICML 2025☆51May 23, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Standardized Minecraft Diamond Environment for Reinforcement Learning☆39May 19, 2023Updated 3 years ago
- ☆96Feb 16, 2026Updated 4 months ago
- ☆19Mar 1, 2023Updated 3 years ago
- Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!☆271Jun 10, 2026Updated 3 weeks ago
- Clean single-file implementation of offline RL algorithms in JAX☆182Jun 5, 2026Updated 3 weeks ago
- ☆19Nov 25, 2022Updated 3 years ago
- Code for Powderworld: A Platform for Understanding Generalization via Rich Task Distributions☆74Aug 31, 2024Updated last year
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 3 years ago
- Accelerated minigrid environments with JAX☆171Oct 20, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆23Aug 19, 2022Updated 3 years ago
- VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)☆29Jan 14, 2025Updated last year
- An API conversion tool for popular external reinforcement learning environments☆213Jun 24, 2026Updated last week
- ExORL: Exploratory Data for Offline Reinforcement Learning☆136Feb 8, 2022Updated 4 years ago
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆128Aug 30, 2024Updated last year
- ☆55Feb 28, 2024Updated 2 years ago
- Change-Based Exploration Transfer☆35Apr 24, 2022Updated 4 years ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆418Jun 20, 2026Updated 2 weeks ago
- Data-Driven NetHack Tools: Datasets (30+) and recurrent-baselines (AWAC, BC, CQL, IQL, REM)☆43Aug 22, 2023Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆24Oct 15, 2024Updated last year
- Jaxpr Visualisation Tool☆37Dec 22, 2024Updated last year
- ☆36Nov 22, 2024Updated last year
- Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations☆115Apr 16, 2026Updated 2 months ago
- Distrax, but in equinox. Lightweight JAX library of probability distributions and bijectors.☆47Jun 15, 2026Updated 2 weeks ago
- (NeurIPS 2023) Residual Q-Learning: Offline and Online Policy Customization without Value☆35Mar 29, 2024Updated 2 years ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆18Aug 8, 2022Updated 3 years ago