Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights on all environments.
☆59Aug 4, 2022Updated 3 years ago
Alternatives and similar repositories for ppo_jax
Users that are interested in ppo_jax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- JAX implementations of core Deep RL algorithms☆84May 2, 2022Updated 3 years ago
- ☆18Mar 18, 2026Updated last week
- JAX code for the paper "Control-Oriented Model-Based Reinforcement Learning with Implicit Differentiation"☆44Jun 14, 2021Updated 4 years ago
- JAX implementations of various deep reinforcement learning algorithms.☆26Feb 2, 2025Updated last year
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆754Oct 26, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Various reinforcement learning algorithms written in Jax + Flax☆26Jun 24, 2023Updated 2 years ago
- RL Environments in JAX 🌍☆873May 30, 2025Updated 9 months ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- Collection of resources on plasticity loss in deep reinforcement learning☆23Nov 12, 2024Updated last year
- A2C training of Relational Deep Reinforcement Learning Architecture☆13Jun 22, 2022Updated 3 years ago
- 📴 OffCon^3: SOTA PyTorch SAC and TD3 Implementations (arxiv: 2101.11331)☆25Jun 20, 2021Updated 4 years ago
- ☆53Jan 20, 2023Updated 3 years ago
- ⚡ Flashbax: Accelerated Replay Buffers in JAX☆275Sep 22, 2025Updated 6 months ago
- POPGym Library in JAX☆12Apr 15, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- MuJoCo models for Unitree Robots☆12Nov 24, 2021Updated 4 years ago
- Action Value Gradient Algorithm☆28May 18, 2025Updated 10 months ago
- Code that can be used to reproduce the experiments in our paper "Estimating Risk and Uncertainty in Deep Reinforcement Learning"☆31Nov 22, 2022Updated 3 years ago
- Revisiting Rainbow☆76Jun 9, 2021Updated 4 years ago
- Standard interface for entity based reinforcement learning environments.☆38Feb 28, 2024Updated 2 years ago
- Modular framework for Reinforcement Learning in python☆184Feb 1, 2023Updated 3 years ago
- Extending JAX with custom C++ and CUDA code☆403Aug 18, 2024Updated last year
- ☆15Jul 1, 2021Updated 4 years ago
- ☆13Aug 9, 2022Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms☆576Feb 25, 2026Updated last month
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆65Jan 2, 2026Updated 2 months ago
- Rainbow DQN implementation accompanying the paper "Fast and Data-Efficient Training of Rainbow" which reaches 205.7 median HNS after 10M …☆44Dec 11, 2021Updated 4 years ago
- C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.☆1,297Updated this week
- Simple single-file baselines for Q-Learning in pure-GPU setting☆238Nov 24, 2025Updated 4 months ago
- Curated list of JAX Resources and Packages☆35Mar 2, 2026Updated 3 weeks ago
- Code for the paper "Leveraging Procedural Generation to Benchmark Reinforcement Learning"☆181Apr 2, 2023Updated 2 years ago
- krazy grid world☆25Mar 2, 2020Updated 6 years ago
- ☆13Aug 28, 2025Updated 7 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆1,413Mar 2, 2026Updated 3 weeks ago
- ☆35Nov 22, 2024Updated last year
- Generalised UDRL☆37May 12, 2022Updated 3 years ago
- Baselines for gymnax 🤖☆75Apr 3, 2023Updated 2 years ago
- 🪐 The Sebulba architecture to scale reinforcement learning on Cloud TPUs in JAX☆62Oct 23, 2023Updated 2 years ago
- 🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL☆400Mar 18, 2026Updated last week
- A2C is a special case of PPO!☆22May 20, 2022Updated 3 years ago