dibyaghosh/jaxrl_m

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/dibyaghosh/jaxrl_m)

dibyaghosh / jaxrl_m

Skeleton for scalable and flexible Jax RL implementations

☆100

Alternatives and similar repositories for jaxrl_m

Users that are interested in jaxrl_m are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

dibyaghosh / icvf_release
View on GitHub
Public code for "Reinforcement Learning from Passive Data via Latent Intentions"
☆89Nov 19, 2023Updated 2 years ago
ikostrikov / jaxrl
View on GitHub
JAX (Flax) implementation of algorithms for Deep Reinforcement Learning with continuous action spaces.
☆757Oct 26, 2022Updated 3 years ago
facebookresearch / ExPLORe
View on GitHub
This is code to accompany the paper "Accelerating Exploration with Unlabeled Prior Data".
☆26Dec 5, 2023Updated 2 years ago
seohongpark / HIQL
View on GitHub
HIQL: Offline Goal-Conditioned RL with Latent States as Actions (NeurIPS 2023)
☆98Dec 1, 2024Updated last year
rail-berkeley / grif_release
View on GitHub
Code release for the paper "Goal Representations for Instruction Following: A Semi-Supervised Language Interface to Control"
☆17Apr 9, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
kvfrans / rlbase_stable
View on GitHub
☆46Jul 12, 2024Updated 2 years ago
EdanToledo / Stoix
View on GitHub
🏛️A research-friendly codebase for fast experimentation of single-agent reinforcement learning in JAX • End-to-End JAX RL
☆416Mar 18, 2026Updated 4 months ago
Howuhh / streaming-drl-jax
View on GitHub
streaming deep reinforcement learning but 4x faster with jax!
☆19Jan 4, 2026Updated 6 months ago
Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
kenjyoung / dreamerv2_JAX
View on GitHub
An implementation of DreamerV2 written in JAX, with support for running multiple random seeds of an experiment on a single GPU.
☆18Jan 16, 2023Updated 3 years ago
evgenii-nikishin / rl_with_resets
View on GitHub
JAX implementation of deep RL agents with resets from the paper "The Primacy Bias in Deep Reinforcement Learning"
☆107May 17, 2022Updated 4 years ago
facebookresearch / gen_dgrl
View on GitHub
Official codebase for "The Generalization Gap in Offline Reinforcement Learning" accepted to ICLR 2024
☆29Apr 8, 2026Updated 3 months ago
araffin / sbx
View on GitHub
SBX: Stable Baselines Jax (SB3 + Jax) RL algorithms
☆602Updated this week
tinker495 / jax-baseline
View on GitHub
Jax-Baseline is a Reinforcement Learning implementation using JAX and Flax/Haiku libraries, mirroring the functionality of Stable-Baselin…
☆67Updated this week
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ethanluoyc / corax
View on GitHub
Corax: Core RL in JAX
☆41Feb 22, 2024Updated 2 years ago
conglu1997 / v-d4rl
View on GitHub
Challenges and Opportunities in Offline Reinforcement Learning from Visual Observations
☆115Apr 16, 2026Updated 3 months ago
keraJLi / rejax
View on GitHub
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!
☆274Jun 10, 2026Updated last month
UT-Austin-RPL / sailor
View on GitHub
☆20Jun 16, 2023Updated 3 years ago
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆438Jan 14, 2026Updated 6 months ago
RobertTLange / gymnax
View on GitHub
RL Environments in JAX 🌍
☆910Apr 2, 2026Updated 3 months ago
luchris429 / purejaxrl
View on GitHub
Really Fast End-to-End Jax RL Implementations
☆1,092Sep 9, 2024Updated last year
penn-pal-lab / peg
View on GitHub
Code for "Planning Goals for Exploration", ICLR2023 Spotlight. An unsupervised RL agent for hard exploration tasks.
☆83May 13, 2024Updated 2 years ago
enjeeneer / zero-shot-rl
View on GitHub
VC-FB and MC-FB algorithms from "Zero-Shot Reinforcement Learning from Low Quality Data" (NeurIPS 2024)
☆29Jan 14, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
instadeepai / flashbax
View on GitHub
⚡ Flashbax: Accelerated Replay Buffers in JAX
☆279Sep 22, 2025Updated 9 months ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 7 months ago
seohongpark / HILP
View on GitHub
Foundation Policies with Hilbert Representations (ICML 2024)
☆104Sep 29, 2025Updated 9 months ago
d5rlbenchmark / d5rl
View on GitHub
☆31Oct 3, 2023Updated 2 years ago
RLAgent / factor-world
View on GitHub
Decomposing the Generalization Gap in Imitation Learning for Visual Robotic Manipulation (2023)
☆47Jul 14, 2023Updated 3 years ago
astanic / crafter-ood
View on GitHub
☆19Nov 25, 2022Updated 3 years ago
ikostrikov / jaxrl2
View on GitHub
☆58Jan 20, 2023Updated 3 years ago
RobertTLange / gymnax-blines
View on GitHub
Baselines for gymnax 🤖
☆78Apr 3, 2023Updated 3 years ago
ben-eysenbach / mnm
View on GitHub
Code to accompany the paper "Mismatched No More: Joint Model-Policy Optimization for Model-Based RL"
☆21Oct 6, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
frt03 / jax_dt
View on GitHub
Minimal Decision Transformer Implementation written in Jax (Flax).
☆18Aug 8, 2022Updated 3 years ago
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆182Jun 5, 2026Updated last month
jianlanluo / SAQ
View on GitHub
☆34Jun 9, 2025Updated last year
clvrai / furniture-bench
View on GitHub
FurnitureBench: Real-World Furniture Assembly Benchmark (RSS 2023)
☆235Mar 31, 2025Updated last year
ikostrikov / rlpd
View on GitHub
☆409Feb 13, 2023Updated 3 years ago
haosulab / RPG
View on GitHub
Code base for paper: Reparameterized Policy Learning for Multimodal Trajectory Optimization
☆28Jul 19, 2023Updated 3 years ago
boschresearch / ube-mbrl
View on GitHub
Model-Based Uncertainty in Value Functions (AISTATS2023)
☆16Feb 28, 2023Updated 3 years ago