zhouzypaul/wsrl

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/zhouzypaul/wsrl)

zhouzypaul / wsrl

JAX implementation of WSRL and RL baselines | ICLR 2025

☆132

Alternatives and similar repositories for wsrl

Users that are interested in wsrl are comparing it to the libraries listed below

Sorting:

seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆181Aug 2, 2025Updated 7 months ago
Exiam6 / ViTaL
View on GitHub
Accompanying codebase for paper"Touch begins where vision ends: Generalizable policies for contact-rich manipulation"
☆100Jul 1, 2025Updated 8 months ago
ColinQiyangLi / qc
View on GitHub
☆356Feb 5, 2026Updated last month
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆281Jul 21, 2025Updated 7 months ago
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆339Jan 14, 2026Updated last month
WEIRDLabUW / sgft
View on GitHub
☆19Feb 6, 2025Updated last year
cccedric / conrft
View on GitHub
This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".
☆329Nov 11, 2025Updated 3 months ago
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆35Feb 9, 2026Updated 3 weeks ago
aliang8 / varibad_jax
View on GitHub
☆10Jun 27, 2024Updated last year
JYChen18 / TaskDexGrasp
View on GitHub
Minimal codes for "Task-Oriented Dexterous Hand Pose Synthesis Using Differentiable Grasp Wrench Boundary Estimator [IROS 2024]"
☆15Feb 12, 2025Updated last year
kvfrans / jaxtransformer
View on GitHub
Minimal Transformer base in JAX. A single backbone for language modelling, diffusion, classification, etc...
☆14May 28, 2025Updated 9 months ago
ankile / robust-rearrangement
View on GitHub
From Imitation to Refinement -- Residual RL for Precise Assembly
☆215Dec 2, 2025Updated 3 months ago
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆120Jul 31, 2024Updated last year
kvfrans / cfgrl
View on GitHub
☆91May 31, 2025Updated 9 months ago
tongzhoumu / policy_decorator
View on GitHub
Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"
☆110Oct 24, 2025Updated 4 months ago
hengyuan-hu / ibrl
View on GitHub
☆70Sep 23, 2024Updated last year
adrialopezescoriza / demo3
View on GitHub
Official implementation of DEMO3
☆65Jul 29, 2025Updated 7 months ago
SonyResearch / simba
View on GitHub
☆122Feb 25, 2025Updated last year
rail-berkeley / serl
View on GitHub
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
☆780Oct 27, 2025Updated 4 months ago
irom-princeton / dppo
View on GitHub
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024
☆764Feb 4, 2025Updated last year
younggyoseo / FastTD3
View on GitHub
☆429Oct 12, 2025Updated 4 months ago
GuanxingLu / vlarl
View on GitHub
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆401Nov 8, 2025Updated 3 months ago
seohongpark / HILP
View on GitHub
Foundation Policies with Hilbert Representations (ICML 2024)
☆105Sep 29, 2025Updated 5 months ago
sash-a / CleanRL.jl
View on GitHub
Simple single file implementations of Reinforcement Learning algorithms in Julia
☆23Feb 15, 2025Updated last year
uiuckimlab / CHILD
View on GitHub
Official Github repository for "CHILD: a Whole-Body Humanoid Teleoperation System". (Humanoids 2025)
☆41Jan 30, 2026Updated last month
nisutte / voxel-serl
View on GitHub
SERL: A Software Suite for Sample-Efficient Robotic Reinforcement Learning
☆27Aug 28, 2025Updated 6 months ago
MaxSobolMark / PolicyAgnosticRL
View on GitHub
☆87Aug 4, 2025Updated 7 months ago
rail-berkeley / hil-serl
View on GitHub
☆1,175Oct 27, 2025Updated 4 months ago
youliangtan / agentlace
View on GitHub
Connect agent policies for distributed ML applications
☆74Mar 12, 2025Updated 11 months ago
ZhengyiLuo / Omnigrasp
View on GitHub
Official implementation of NeurIPS 2024 paper: "Omnigrasp: Simulated Humanoid Grasping on Diverse Objects".
☆148Nov 17, 2025Updated 3 months ago
seohongpark / METRA
View on GitHub
METRA: Scalable Unsupervised RL with Metric-Aware Abstraction (ICLR 2024)
☆87Oct 15, 2023Updated 2 years ago
nissymori / JAX-CORL
View on GitHub
Clean single-file implementation of offline RL algorithms in JAX
☆170Nov 24, 2025Updated 3 months ago
omron-sinicx / action-constrained-RL-benchmark
View on GitHub
☆26Apr 26, 2024Updated last year
gauthamvasan / avg
View on GitHub
Action Value Gradient Algorithm
☆28May 18, 2025Updated 9 months ago
sNiper-Qian / pianomime
View on GitHub
☆60Apr 4, 2025Updated 11 months ago
uiuckimlab / PAPRAS-V0-Public
View on GitHub
☆18Jul 9, 2025Updated 7 months ago
facebookresearch / humenv
View on GitHub
HumEnv is an SMPL humanoid environment enabling systematic model comparison and reproducibility
☆116Apr 22, 2025Updated 10 months ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆237Nov 24, 2025Updated 3 months ago
mazpie / genrl
View on GitHub
[NeurIPS 2024] GenRL: Multimodal-foundation world models enable grounding language and video prompts into embodied domains, by turning th…
☆86Apr 4, 2025Updated 11 months ago