MarcoMeter/recurrent-ppo-truncated-bptt

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MarcoMeter/recurrent-ppo-truncated-bptt)

MarcoMeter / recurrent-ppo-truncated-bptt

Baseline implementation of recurrent PPO using truncated BPTT

☆161

Alternatives and similar repositories for recurrent-ppo-truncated-bptt

Users that are interested in recurrent-ppo-truncated-bptt are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MarcoMeter / episodic-transformer-memory-ppo
View on GitHub
Clean baseline implementation of PPO using an episodic TransformerXL memory
☆212Jun 18, 2024Updated 2 years ago
MarcoMeter / endless-memory-gym
View on GitHub
Challenging Memory-based Deep Reinforcement Learning Agents
☆113Oct 27, 2024Updated last year
MarcoMeter / neroRL
View on GitHub
Deep Reinforcement Learning Framework done with PyTorch
☆43Mar 12, 2025Updated last year
siekmanj / r2l
View on GitHub
Recurrent continuous reinforcement learning algorithms implemented in Pytorch.
☆52May 26, 2021Updated 5 years ago
ovechkin-dm / ppo-lstm-parallel
View on GitHub
ppo-lstm-parallel
☆49Mar 26, 2019Updated 7 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
subho406 / Recurrent-PPO-Jax
View on GitHub
Implementation of Proximal Policy Optimization in Jax+Flax
☆21May 18, 2023Updated 3 years ago
roger-creus / Wave-Defense-Learning-Environment
View on GitHub
A videogame made with PyGame turned into an Open AI Gym Learning Environment for Reinforcement Learning agents.
☆14Jan 3, 2023Updated 3 years ago
subho406 / agalite
View on GitHub
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)
☆24Oct 15, 2024Updated last year
datvodinh / recurrent-ppo
View on GitHub
A Reinforcement Learning Project using PPO + LSTM
☆113Jul 30, 2023Updated 2 years ago
YangShengqi / cartpole_ppo_lstm
View on GitHub
☆13Jun 1, 2020Updated 6 years ago
kngwyu / Rainy
View on GitHub
Deep RL agents with PyTorch
☆35Sep 25, 2021Updated 4 years ago
Reytuag / transformerXL_PPO_JAX
View on GitHub
☆96Feb 16, 2026Updated 4 months ago
SafeRL-Lab / BenchNetRL
View on GitHub
Benchmarking of Neural Network Architectures in Reinforcement Learning.
☆39Jan 22, 2026Updated 5 months ago
Howuhh / sac-n-jax
View on GitHub
Single-file SAC-N implementation on jax with flax and equinox. 10x faster than pytorch
☆56May 21, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
symoon11 / dreamerv3-flax
View on GitHub
Flax Implementation of DreamerV3 on Crafter
☆18Nov 29, 2025Updated 7 months ago
facebookresearch / svg
View on GitHub
On the model-based stochastic value gradient for continuous reinforcement learning
☆58Mar 6, 2026Updated 4 months ago
akazemipour / PPO-RND
View on GitHub
Random network distillation on Montezuma's Revenge and Super Mario Bros.
☆55May 12, 2025Updated last year
vwxyzjn / ppo-implementation-details
View on GitHub
The source code for the blog post The 37 Implementation Details of Proximal Policy Optimization
☆942Mar 23, 2024Updated 2 years ago
zhihanyang2022 / off-policy-continuous-control
View on GitHub
Official PyTorch code for "Recurrent Off-policy Baselines for Memory-based Continuous Control" (DeepRL Workshop, NeurIPS 21)
☆93Nov 21, 2023Updated 2 years ago
hamishs / JAX-RL
View on GitHub
JAX implementations of various deep reinforcement learning algorithms.
☆25Feb 2, 2025Updated last year
twni2016 / pomdp-baselines
View on GitHub
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022
☆348Apr 26, 2026Updated 2 months ago
Improbable-AI / eipo
View on GitHub
Official codebase for Redeeming Intrinsic Rewards via Constrained Policy Optimization
☆83Apr 13, 2023Updated 3 years ago
Stable-Baselines-Team / stable-baselines3-contrib
View on GitHub
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code
☆727Jun 19, 2026Updated 2 weeks ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
jurgisp / memory-maze
View on GitHub
Evaluating long-term memory of reinforcement learning algorithms
☆180Jun 23, 2023Updated 3 years ago
chscheller / minerl_agent
View on GitHub
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Mar 24, 2023Updated 3 years ago
TakuyaHiraoka / Dropout-Q-Functions-for-Doubly-Efficient-Reinforcement-Learning
View on GitHub
Source files to replicate experiments in my ICLR 2022 paper.
☆74Jul 17, 2025Updated 11 months ago
lcswillems / torch-ac
View on GitHub
Recurrent and multi-process PyTorch implementation of deep reinforcement Actor-Critic algorithms A2C and PPO
☆208Oct 5, 2022Updated 3 years ago
jlin816 / homegrid
View on GitHub
A minimal home grid world environment to evaluate language understanding in interactive agents.
☆24Sep 6, 2023Updated 2 years ago
seungyulhan / disc
View on GitHub
☆10Aug 17, 2022Updated 3 years ago
alex-petrenko / sample-factory
View on GitHub
High throughput synchronous and asynchronous reinforcement learning
☆1,006Jul 2, 2026Updated last week
openai / phasic-policy-gradient
View on GitHub
Code for the paper "Phasic Policy Gradient"
☆266Apr 2, 2023Updated 3 years ago
entity-neural-network / entity-gym
View on GitHub
Standard interface for entity based reinforcement learning environments.
☆39Feb 28, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
stweigand / gym-pomdp-wrappers
View on GitHub
POMDP wrappers for OpenAI Gym
☆15Nov 4, 2019Updated 6 years ago
byronbenharris / reinforcement-learning-trajectory-optimization
View on GitHub
An AI agent that uses Deep Q-Networks and the DDPG algorithm to learn trajectory optimization in a customized gym environment.
☆13Oct 30, 2021Updated 4 years ago
AboudyKreidieh / h-baselines
View on GitHub
A repository of high-performing hierarchical reinforcement learning models and algorithms.
☆339Mar 24, 2023Updated 3 years ago
uzh-rpg / sitt
View on GitHub
Repository relating to "Student-Informed Teacher Training" (ICLR, 2025).
☆47Feb 27, 2025Updated last year
Coac / never-give-up
View on GitHub
PyTorch implementation of Never Give Up: Learning Directed Exploration Strategies
☆57Jan 22, 2021Updated 5 years ago
alantess / gtrxl-torch
View on GitHub
Gated Transformer Model for Computer Vision
☆25Jul 11, 2021Updated 4 years ago
henry-prior / jax-rl
View on GitHub
JAX implementations of core Deep RL algorithms
☆84May 2, 2022Updated 4 years ago