subho406/Recurrent-PPO-Jax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/subho406/Recurrent-PPO-Jax)

subho406 / Recurrent-PPO-Jax

Implementation of Proximal Policy Optimization in Jax+Flax

☆21

Alternatives and similar repositories for Recurrent-PPO-Jax

Users that are interested in Recurrent-PPO-Jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

symoon11 / dreamerv3-flax
View on GitHub
Flax Implementation of DreamerV3 on Crafter
☆18Nov 29, 2025Updated 7 months ago
luchris429 / discovered-policy-optimisation
View on GitHub
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆12Jun 15, 2023Updated 3 years ago
subho406 / pytorch2jax
View on GitHub
Pytorch2Jax is a small Python library that provides functions that wraps PyTorch models into Jax functions and Flax modules.
☆21Feb 20, 2023Updated 3 years ago
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
hsing-wang / WMT2020_BioMedical
View on GitHub
☆15Jul 16, 2021Updated 5 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated 2 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
taodav / pobax
View on GitHub
Partially Observable Benchmarks in JAX
☆25Apr 30, 2026Updated 2 months ago
qfettes / CuriosityDrivenExplorationBySelfSupervisedPrediction
View on GitHub
Reproduction of Curiosity-driven Exploration by Self-supervised Prediction in PyTorch
☆12Jun 10, 2019Updated 7 years ago
yun-kwak / decision-transformer-jax
View on GitHub
Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku
☆13Aug 14, 2024Updated last year
thiagolopes97 / A-Star-Based-Algorithm-Applied-to-Target-Search-and-Rescue-by-an-UAV-Swarm
View on GitHub
☆16Sep 28, 2022Updated 3 years ago
radarFudan / mamba-minimal-jax
View on GitHub
☆36Nov 22, 2024Updated last year
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
MANGA-UOFA / PTfer
View on GitHub
☆11Nov 13, 2024Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
tinkoff-ai / sac-rnd
View on GitHub
Official implementation for "Anti-Exploration by Random Network Distillation", ICML 2023
☆58Feb 3, 2023Updated 3 years ago
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
google-deepmind / csuite
View on GitHub
☆47Updated this week
wxjiao / Data-Rejuvenation
View on GitHub
Implementation of our paper "Data Rejuvenation: Exploiting Inactive Training Examples for Neural Machine Translation" in EMNLP-2020.
☆23Aug 20, 2021Updated 4 years ago
uoe-agents / CMID
View on GitHub
☆13Apr 25, 2024Updated 2 years ago
MarcoMeter / recurrent-ppo-truncated-bptt
View on GitHub
Baseline implementation of recurrent PPO using truncated BPTT
☆161Apr 28, 2024Updated 2 years ago
subho406 / agalite
View on GitHub
AGaLiTe: Approximate Gated Linear Transformers for Online Reinforcement Learning (Published in TMLR)
☆24Oct 15, 2024Updated last year
RyanNavillus / PPO-v3
View on GitHub
Adding Dreamer-v3's implementation tricks to CleanRL's PPO
☆16May 19, 2023Updated 3 years ago
NishanthVAnand / prediction-and-control-in-continual-reinforcement-learning
View on GitHub
Code to reproduce results from the paper: Prediction and Control in Continual Reinforcement Learning, NeurIPS 2023.
☆13May 10, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
holarissun / RewardShifting
View on GitHub
Code for NeurIPS 2022 paper Exploiting Reward Shifting in Value-Based Deep RL
☆29Oct 29, 2023Updated 2 years ago
MANGA-UOFA / Prompt-Edit
View on GitHub
An official implementation for the EMNLP 2023 Findings paper "Prompt-Based Editing for Text Style Transfer"
☆13Dec 9, 2023Updated 2 years ago
cswinter / hyperstate
View on GitHub
Opinionated library for managing hyperparameters and mutable state of machine learning training systems.
☆19Aug 4, 2023Updated 2 years ago
lucasnfe / puct-music-emotion
View on GitHub
☆15Nov 28, 2022Updated 3 years ago
RPegoud / jym
View on GitHub
JAX implementation of RL algorithms and vectorized environments
☆50Dec 26, 2023Updated 2 years ago
wxjiao / Pre-CODE
View on GitHub
Implementation of our paper "Exploiting Unsupervised Data for Emotion Recognition in Conversations" in the Findings of EMNLP-2020.
☆13Nov 17, 2020Updated 5 years ago
abhanjac / robots_for_industry_4.0_tasks
View on GitHub
Using mobile robots in an industry 4.0 setting for working alongside human operators and assisting them to increase the efficiency of man…
☆11Dec 12, 2022Updated 3 years ago
AGI-Labs / continual_rl
View on GitHub
Continual reinforcement learning baselines: experiment specifications, implementation of existing methods, and common metrics. Easily ext…
☆137Jul 6, 2023Updated 3 years ago
abhayraw1 / planet-torch
View on GitHub
A PyTorch Implementation of PlaNet: A Deep Planning Network for Reinforcement Learning
☆13Aug 31, 2020Updated 5 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
xinyiz0931 / bin-picking-robot
View on GitHub
Binpicking tool including vision, planning and robot control
☆13Jan 17, 2024Updated 2 years ago
google-deepmind / nao_top10
View on GitHub
☆19Mar 1, 2023Updated 3 years ago
notmahi / disk
View on GitHub
PyTorch implementation for "Discovery of Incremental Skills" (DISk) algorithm from ICLR 2022 paper "One After Another: Learning Increment…
☆21Mar 22, 2022Updated 4 years ago
mkschleg / GVFN
View on GitHub
☆10Apr 24, 2021Updated 5 years ago
StoneT2000 / robojax
View on GitHub
A high-performance reinforcement learning library in jax specialized for robotic learning
☆22Sep 4, 2023Updated 2 years ago
dunnolab / NinA
View on GitHub
Official implementation of "NinA: Normalizing Flows in Action. Training VLA Models with Normalizing Flows"
☆17Sep 22, 2025Updated 10 months ago
wdlctc / recurrent_maskable
View on GitHub
☆17May 7, 2023Updated 3 years ago