RyanNavillus / PPO-v3Links

Adding Dreamer-v3's implementation tricks to CleanRL's PPO

☆12

Alternatives and similar repositories for PPO-v3

Users that are interested in PPO-v3 are comparing it to the libraries listed below

Sorting:

luchris429 / discovered-policy-optimisation
Code for Discovered Policy Optimisation (NeurIPS 2022)
☆11Updated 2 years ago
symoon11 / dreamerv3-flax
Flax Implementation of DreamerV3 on Crafter
☆16Updated 4 months ago
hr0nix / omega
A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…
☆41Updated 2 years ago
DHDev0 / Muzero-unplugged
Pytorch Implementation of MuZero Unplugged for gym environment. This algorithm is capable of supporting a wide range of action and observ…
☆27Updated 3 weeks ago
keraJLi / synthetic-gymnax
Drop-in environment replacements that make your RL algorithm train faster.
☆21Updated last year
Div99 / XQL
Extreme Q-Learning: Max Entropy RL without Entropy
☆87Updated 2 years ago
MarcoMeter / neroRL
Deep Reinforcement Learning Framework done with PyTorch
☆37Updated 4 months ago
kvfrans / fre
Code for "Unsupervised Zero-Shot RL via Functional Reward Representations"
☆57Updated last year
LAMDA-RL / OfflineRL-Lib
Benchmarked implementations of Offline RL Algorithms.
☆74Updated 4 months ago
MarcoMeter / endless-memory-gym
Challenging Memory-based Deep Reinforcement Learning Agents
☆101Updated 8 months ago
brownirl / lambda_discrepancy
Mitigating Partial Observability in Sequential Decision Processes via the Lambda Discrepancy
☆18Updated 8 months ago
sash-a / CleanRL.jl
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆22Updated 5 months ago
instadeepai / fastpbrl
Vectorization techniques for fast population-based training.
☆56Updated 2 years ago
FLAIROx / jaxirl
Contains JAX implementation of algorithms for inverse reinforcement learning
☆73Updated 10 months ago
luchris429 / popjaxrl
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆107Updated last year
AlexGoldie / rl-learned-optimization
Official Implementation of "Can Learned Optimization Make Reinforcement Learning Less Difficult"
☆25Updated 2 months ago
tuero / muzero-cpp
A C++ pytorch implementation of MuZero
☆39Updated last year
frt03 / jax_dt
Minimal Decision Transformer Implementation written in Jax (Flax).
☆17Updated 2 years ago
dojeon-ai / SimbaV2
Code for "SimbaV2: Hyperspherical Normalization for Scalable Deep Reinforcement Learning"
☆57Updated last month
jacooba / hyper
Code for the papers Hypernetworks in Meta-Reinforcement Learning (Beck et al., 2022) and Recurrent Hypernetworks are Surprisingly Strong …
☆15Updated 11 months ago
instadeepai / outer-value-function-meta-rl
Code of the paper: Debiasing Meta-Gradient Reinforcement Learning by Learning the Outer Value Function
☆13Updated 2 years ago
facebookresearch / gen_dgrl
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆28Updated 11 months ago
Hwhitetooth / jax_muzero
An implementation of MuZero in JAX.
☆56Updated 2 years ago
hr0nix / dejax
Accelerated replay buffers in JAX
☆41Updated 2 years ago
facebookresearch / MRQ
MR.Q is a general-purpose model-free reinforcement learning algorithm.
☆105Updated 3 weeks ago
hwang-ua / inac_pytorch
☆19Updated 2 years ago
RobertTLange / gymnax-blines
Baselines for gymnax 🤖
☆67Updated 2 years ago
twni2016 / Memory-RL
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment, NeurIPS 2023 (oral)
☆63Updated last year
danijar / crafter-baselines
Docker containers of baseline agents for the Crafter environment
☆28Updated 3 years ago
bmazoure / ppo_jax
Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…
☆57Updated 2 years ago