JAX implementations of various deep reinforcement learning algorithms.
☆26Feb 2, 2025Updated last year
Alternatives and similar repositories for JAX-RL
Users that are interested in JAX-RL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Various reinforcement learning algorithms written in Jax + Flax☆26Jun 24, 2023Updated 2 years ago
- JAX implementation of RL algorithms and vectorized environments☆51Dec 26, 2023Updated 2 years ago
- JAX implementations of core Deep RL algorithms☆84May 2, 2022Updated 3 years ago
- Jax implementation of Proximal Policy Optimization (PPO) specifically tuned for Procgen, with benchmarked results and saved model weights…☆59Aug 4, 2022Updated 3 years ago
- Autoregressive transformer in JAX from scratch☆23Jan 28, 2022Updated 4 years ago
- A JAX Implementation of the Twin Delayed DDPG Algorithm☆35Mar 12, 2020Updated 6 years ago
- A collection of RL algorithms written in JAX.☆105Jul 5, 2022Updated 3 years ago
- A high throughput, end-to-end RL library for infinite-horizon tasks.☆23Oct 22, 2025Updated 5 months ago
- Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…☆65Jan 2, 2026Updated 2 months ago
- Simple CIFAR10 ResNet example with JAX.☆23Jun 1, 2021Updated 4 years ago
- Mini RL Lab☆17Jun 17, 2024Updated last year
- A number of agents (PPO, MuZero) with a Perceiver-based NN architecture that can be trained to achieve goals in nethack/minihack environm…☆43Sep 19, 2022Updated 3 years ago
- The code for experiments conducted to verify the correctness of mirror learning.☆11Jun 3, 2022Updated 3 years ago
- ☆17Feb 12, 2025Updated last year
- An implementation of the Escape Room domain for Hierarchical Reinforcement Learning.☆25May 15, 2019Updated 6 years ago
- Code for Discovered Policy Optimisation (NeurIPS 2022)☆12Jun 15, 2023Updated 2 years ago
- Simple tools to mix and match PyTorch and Jax - Get the best of both worlds!☆37Jan 8, 2026Updated 2 months ago
- Implementation of some of the Deep Distributional Reinforcement Learning Algorithms.☆26Jun 17, 2025Updated 9 months ago
- Agar.io OpenAI Gym Learning Environment☆12Sep 10, 2023Updated 2 years ago
- Risk-sensitive Inverse Reinforcement Learning☆11Sep 11, 2019Updated 6 years ago
- Deep RL agents with PyTorch☆36Sep 25, 2021Updated 4 years ago
- Standard interface for entity based reinforcement learning environments.☆38Feb 28, 2024Updated 2 years ago
- Gymnasium environment for research of UAVs and risk constraints☆12Oct 29, 2024Updated last year
- ☆18Sep 7, 2023Updated 2 years ago
- Decoupled Neural Interfaces Using Synthetic Gradients - under develeopment☆11Jun 27, 2025Updated 8 months ago
- Codebase describing experiments in Truncation Sampling as Language Model Desmoothing☆13Dec 6, 2022Updated 3 years ago
- Robust Reinforcement Learning Benchmark☆12Sep 22, 2024Updated last year
- [RAL 2025] MTIL: Encoding Full History with Mamba for Temporal Imitation Learning☆27Nov 17, 2025Updated 4 months ago
- ☆11Feb 29, 2024Updated 2 years ago
- Baseline implementation of recurrent PPO using truncated BPTT☆160Apr 28, 2024Updated last year
- Log to W&B from Julia☆12Jun 13, 2022Updated 3 years ago
- JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.☆753Oct 26, 2022Updated 3 years ago
- Scalable Monotonic Neural Networks☆12Mar 14, 2024Updated 2 years ago
- Algorithms for Policy Evaluation, Estimation of Action Values, Policy Improvement, Policy Iteration, Truncated Policy Evaluation, Truncat…☆11Apr 3, 2019Updated 6 years ago
- [ 👾 ] ➡️ 💾 ➡️ { 🎮🕹️ } Extra Stable-Baselines3 buffer classes. Reducing RL memory usage drastically with minimal overhead.☆22Dec 9, 2025Updated 3 months ago
- Code to accompany "Conformal Prediction as Bayesian Quadrature" by Jake Snell & Tom Griffiths (ICML 2025 Outstanding Paper)☆23Jul 14, 2025Updated 8 months ago
- A collection of meta-learning algorithms in Jax☆24Sep 3, 2022Updated 3 years ago
- Code for Learning to Defer to Multiple Experts: Consistent Surrogate Losses, Confidence Calibration, and Conformal Ensembles [AISTATS'23]☆13Jul 28, 2023Updated 2 years ago
- RL Environments in JAX 🌍☆871May 30, 2025Updated 9 months ago