Clean baseline implementation of PPO using an episodic TransformerXL memory
☆205Jun 18, 2024Updated last year
Alternatives and similar repositories for episodic-transformer-memory-ppo
Users that are interested in episodic-transformer-memory-ppo are comparing it to the libraries listed below
Sorting:
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Challenging Memory-based Deep Reinforcement Learning Agents☆111Oct 27, 2024Updated last year
- ☆92Feb 16, 2026Updated 2 weeks ago
- Deep Reinforcement Learning Framework done with PyTorch☆43Mar 12, 2025Updated 11 months ago
- A Reinforcement Learning Project using PPO + Transformer☆86Jul 21, 2023Updated 2 years ago
- Brain Agent for Large-Scale and Multi-Task Agent Learning☆93Jan 4, 2024Updated 2 years ago
- 🔥Benchmarking of Neural Network Architectures in Reinforcement Learning.☆33Jan 22, 2026Updated last month
- AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)☆23Oct 15, 2024Updated last year
- An easy PyTorch implementation of "Stabilizing Transformers for Reinforcement Learning"☆183Feb 21, 2023Updated 3 years ago
- Deep Transformer Q-Networks for Partially Observable Reinforcement Learning☆171Jul 7, 2024Updated last year
- ☆59Sep 22, 2022Updated 3 years ago
- Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022☆343Aug 22, 2024Updated last year
- CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery☆83Jul 27, 2022Updated 3 years ago
- Implementation of Proximal Policy Optimization in Jax+Flax☆21May 18, 2023Updated 2 years ago
- Bipedal Skills Benchmark for Reinforcement Learning☆25Oct 27, 2022Updated 3 years ago
- Evaluating long-term memory of reinforcement learning algorithms☆164Jun 23, 2023Updated 2 years ago
- Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]☆112Dec 5, 2023Updated 2 years ago
- Official code for TLDR: Unsupervised Goal-Conditioned RL via Temporal Distance-Aware Representations☆36Jan 24, 2026Updated last month
- ☆19Nov 25, 2022Updated 3 years ago
- This repository is a collection of widely used self-supervised auxiliary losses used for learning representations in reinforcement learni…☆14Feb 27, 2023Updated 3 years ago
- Adaptive Attention Span for Reinforcement Learning☆136May 11, 2020Updated 5 years ago
- The implementation of Discriminator Soft Actor Critic☆15Jan 25, 2020Updated 6 years ago
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- RL Environments in JAX 🌍☆868May 30, 2025Updated 9 months ago
- (Crafter + NetHack) in JAX. ICML 2024 Spotlight.☆374Feb 10, 2026Updated 3 weeks ago
- Minimal Decision Transformer Implementation written in Jax (Flax).☆17Aug 8, 2022Updated 3 years ago
- Pytorch implementation of "Maximum a Posteriori Policy Optimization" with Retrace for Discrete gym environments☆29Sep 10, 2020Updated 5 years ago
- Advantage weighted Actor Critic for Offline RL☆52Aug 27, 2022Updated 3 years ago
- Dream to Control: Learning Behaviors by Latent Imagination, implemented in PyTorch.☆321Jan 11, 2024Updated 2 years ago
- Docker containers of baseline agents for the Crafter environment☆30Dec 14, 2021Updated 4 years ago
- Partially Observable Process Gym☆212Jun 12, 2025Updated 8 months ago
- PyTorch implementation for all methods and environments in the paper "MIMEx: Intrinsic Rewards from Masked Input Modeling"☆16May 17, 2023Updated 2 years ago
- An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.☆18Jan 16, 2023Updated 3 years ago
- Gym wrapper for pysc2☆10Sep 16, 2022Updated 3 years ago
- 3rd placed submission to the NeurIPS MineRL competition 2019☆10Mar 24, 2023Updated 2 years ago
- Official implementation of Recurrent Action Transformer with Memory, an offline RL agent with memory mechanisms. https://sites.google.com…☆18Nov 23, 2025Updated 3 months ago
- Codebase for the paper "How Crucial is Transformer in Decision Transformer?". Containing experiments on different pendulum tasks and code…☆28Mar 24, 2023Updated 2 years ago
- Really Fast End-to-End Jax RL Implementations☆1,028Sep 9, 2024Updated last year
- Datasets for data-driven deep reinforcement learning with Atari (wrapper for datasets released by Google)☆126Aug 30, 2024Updated last year