bmazoure / ppo_jaxLinks

Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.

☆57

Alternatives and similar repositories for ppo_jax

Users that are interested in ppo_jax are comparing it to the libraries listed below

Sorting:

henry-prior / jax-rl
JAX implementations of core Deep RL algorithms
☆81Updated 3 years ago
RobertTLange / gymnax-blines
Baselines for gymnax 🤖
☆68Updated 2 years ago
danijar / ninjax
General Modules for JAX
☆66Updated 3 months ago
toshikwa / rljax
A collection of RL algorithms written in JAX.
☆102Updated 3 years ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
vwxyzjn / cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
☆114Updated 11 months ago
DramaCow / jaxued
☆82Updated 4 months ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆43Updated 2 years ago
facebookresearch / level-replay
This code implements Prioritized Level Replay, a method for sampling training levels for reinforcement learning agents that exploits the …
☆88Updated 4 years ago
tinker495 / jax-baseline
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆55Updated 2 months ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
jurgisp / memory-maze
Evaluating long-term memory of reinforcement learning algorithms
☆146Updated 2 years ago
evgenii-nikishin / rl_with_resets
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆100Updated 3 years ago
johanobandoc / revisiting_rainbow
Revisiting Rainbow
☆75Updated 4 years ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 11 months ago
RajGhugare19 / alm
Simplifying Model-based RL: Learning Representations, Latent-space Models and Policies with One Objective
☆81Updated 2 years ago
facebookresearch / dcd
Implementations of robust Dual Curriculum Design (DCD) algorithms for unsupervised environment design.
☆133Updated 11 months ago
ethanluoyc / magi
Reinforcement learning library in JAX.
☆100Updated last year
facebookresearch / impact-driven-exploration
impact-driven-exploration
☆131Updated last year
ahmed-touati / controllable_agent
☆47Updated 2 years ago
ucl-dark / paired
PAIRED in PyTorch 🔥
☆62Updated 2 years ago
Reytuag / transformerXL_PPO_JAX
☆81Updated 9 months ago
rll-research / cic
CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery
☆81Updated 3 years ago
danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆28Updated 3 years ago
yfletberliac / adversarially-guided-actor-critic
AGAC: Adversarially Guided Actor-Critic
☆48Updated 3 years ago
ucl-dark / skillhack
SkillHack: A Benchmark for Skill Transfer in Open-Ended Reinforcement Learning
☆17Updated 2 years ago
denisyarats / proto
Proto-RL: Reinforcement Learning with Prototypical Representations
☆82Updated 3 years ago
google-deepmind / csuite
☆44Updated 10 months ago
Egiob / DiversityIsAllYouNeed-SB3
Implementation of Diversity Is All You Need (DIAYN) on top of Stable Baselines 3.
☆12Updated 3 years ago
mttga / purejaxql
Simple single-file baselines for Q-Learning in pure-GPU setting
☆174Updated 4 months ago