Matt00n/PolicyGradientsJax

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/Matt00n/PolicyGradientsJax)

Matt00n / PolicyGradientsJax

On-Policy Policy Gradient Algorithms in JAX

☆44

Alternatives and similar repositories for PolicyGradientsJax

Users that are interested in PolicyGradientsJax are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

MyNameIsArko / RL-Flax
View on GitHub
Various reinforcement learning algorithms written in Jax + Flax
☆26Jun 24, 2023Updated 3 years ago
tinker495 / jax-baseline
View on GitHub
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆67Updated this week
yun-kwak / decision-transformer-jax
View on GitHub
Decision Transformer JAX - Reproduction of 'Decision Transformer: Reinforcement Learning via Sequence Modeling' in JAX and Haiku
☆13Aug 14, 2024Updated last year
lmzintgraf / hyperx
View on GitHub
☆16Aug 2, 2022Updated 3 years ago
chscheller / minerl_agent
View on GitHub
3rd placed submission to the NeurIPS MineRL competition 2019
☆10Mar 24, 2023Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
RyanNavillus / reward-surfaces
View on GitHub
☆19Apr 22, 2024Updated 2 years ago
Snagnar / Hieros
View on GitHub
Implemenation of the HIERarchical imagionation On Structured State Space Sequence Models (HIEROS) paper
☆23Jul 14, 2024Updated 2 years ago
zouwj16 / MUPO
View on GitHub
Code for Policy Bifurcation in Safe Reinforcement Learning
☆10Jul 4, 2025Updated last year
typoverflow / flow-rl
View on GitHub
Flow RL is a high-performance RL library with flow and diffusion models.
☆42Jun 16, 2026Updated last month
Toshihiro-Ota / decision-mamba
View on GitHub
Decision Mamba: Reinforcement Learning via Sequence Modeling with Selective State Spaces
☆53Apr 1, 2024Updated 2 years ago
marcharper / pyed
View on GitHub
Computes trajectories for evolutionary dynamics.
☆15Oct 6, 2020Updated 5 years ago
eager-dev / eager
View on GitHub
[deprecated] Engine Agnostic Gym Environment for Robotics
☆17Feb 10, 2022Updated 4 years ago
FLAIROx / popjym
View on GitHub
POPGym Library in JAX
☆14Apr 15, 2024Updated 2 years ago
J-zin / energy-discrepancy
View on GitHub
NeurIPS'23: Energy Discrepancies: A Score-Independent Loss for Energy-Based Models
☆18Oct 22, 2024Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
luchris429 / popjaxrl
View on GitHub
Benchmarking RL for POMDPs in Pure JAX [Code for "Structured State Space Models for In-Context Reinforcement Learning" (NeurIPS 2023)]
☆116Dec 5, 2023Updated 2 years ago
mansicer / Q-Adapter
View on GitHub
Implementation of ICLR 2025 paper "Q-Adapter: Customizing Pre-trained LLMs to New Preferences with Forgetting Mitigation"
☆18Oct 5, 2024Updated last year
ikostrikov / dmcgym
View on GitHub
☆23Aug 19, 2022Updated 3 years ago
argmax-ai / aime
View on GitHub
Official repository for our paper on "Action Inference by Maximising Evidence: Zero-Shot Imitation from Observation with World Models"
☆13Dec 4, 2023Updated 2 years ago
leowyy / mcmc-importance-sampling
View on GitHub
Markov Chain Monte Carlo (MCMC) and importance sampling in the context of Bayesian linear regression
☆11Feb 25, 2018Updated 8 years ago
badass-techie / These-People-Do-Not-Exist
View on GitHub
AI that generates human faces which have never been seen before. The future is now 😁
☆17Jan 6, 2022Updated 4 years ago
rkcosner / cyberpod_sim_ros
View on GitHub
Segway Simulation Environment
☆11Dec 31, 2020Updated 5 years ago
alec-tschantz / planet
View on GitHub
PlaNet: Learning Latent Dynamics for Planning from Pixels
☆10Feb 13, 2020Updated 6 years ago
kvfrans / jaxtransformer
View on GitHub
Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...
☆16May 28, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
keraJLi / synthetic-gymnax
View on GitHub
Drop-in environment replacements that make your RL algorithm train faster.
☆22Jun 19, 2024Updated 2 years ago
HiddenBeginner / Deep-Reinforcement-Learnings
View on GitHub
심층강화학습 책 https://hiddenbeginner.github.io/Deep-Reinforcement-Learnings
☆11May 10, 2024Updated 2 years ago
FLAIROx / cultural-accumulation
View on GitHub
☆16Jul 16, 2024Updated 2 years ago
zkysfls / 2024-sbdd-benchmark
View on GitHub
☆13Jan 25, 2026Updated 6 months ago
TheUnsolvedDev / CUDA_NN_FS
View on GitHub
This repository features a from-scratch implementation of a neural network using CUDA and C. The primary goal of this project is to lever…
☆12Mar 20, 2025Updated last year
Tom271 / LangevinMC
View on GitHub
MIGSAA Project 2 - Langevin Monte Carlo Algorithms
☆15Jul 25, 2023Updated 3 years ago
tomsilver / camps
View on GitHub
Code for
☆15Oct 16, 2020Updated 5 years ago
mila-iqia / Design-Editing-for-Offline-MBO
View on GitHub
[TMLR 2025 & ICLR 2025 DeLTa] Official Implementation of Design Editing for Offline Model-based Optimization 🧬 🤖
☆10Apr 17, 2025Updated last year
zmsn-2077 / CUP-safe-rl
View on GitHub
NeurIPS2022: Constrained Update Projection Approach to Safe Policy Optimization
☆13Apr 10, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ALRhub / MTS3
View on GitHub
Implementation of Neurips 2023 Paper "Multi Time Scale World Models"
☆18Nov 8, 2024Updated last year
biomedia-mira / demystifying-diffusion
View on GitHub
☆20Jan 15, 2024Updated 2 years ago
lry-bupt / Visual_MARL
View on GitHub
The visualization of a multi-agent reinforcement learning (MARL)-based strategy with efficient exploration strategy.
☆20Oct 28, 2022Updated 3 years ago
lxcnju / sampling
View on GitHub
Some methods to sampling data points from a given distribution.
☆17Jul 16, 2018Updated 8 years ago
facebookresearch / controllable_agent
View on GitHub
The Controllable Agent project trains RL Agents able to optimize any reward function specified in real time, without any further learning…
☆80Jul 17, 2023Updated 3 years ago
NagisaZj / MetaCURE-Public
View on GitHub
☆15Apr 5, 2023Updated 3 years ago
ejmejm / discrete-representations-for-continual-rl
View on GitHub
Code for the paper "Harnessing Discrete Representations for Continual Reinforcement Learning"
☆16Jun 16, 2024Updated 2 years ago