MaxSobolMark/PolicyAgnosticRL

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/MaxSobolMark/PolicyAgnosticRL)

MaxSobolMark / PolicyAgnosticRL

☆92

Alternatives and similar repositories for PolicyAgnosticRL

Users that are interested in PolicyAgnosticRL are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zhouzypaul / wsrl
View on GitHub
JAX implementation of WSRL and RL baselines | ICLR 2025
☆145Feb 26, 2026Updated 4 months ago
tongzhoumu / policy_decorator
View on GitHub
Code for "Policy Decorator: Model-Agnostic Online Refinement for Large Policy Model"
☆117Oct 24, 2025Updated 9 months ago
nakamotoo / dsrl_pi0
View on GitHub
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
☆281Apr 27, 2026Updated 2 months ago
ColinQiyangLi / qc
View on GitHub
☆395Feb 5, 2026Updated 5 months ago
pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
CMU-AIRe / floq
View on GitHub
Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL
☆46Apr 7, 2026Updated 3 months ago
nakamotoo / V-GPS
View on GitHub
official implementation for our paper Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance (CoRL 2024)
☆55Apr 28, 2025Updated last year
irom-princeton / dppo
View on GitHub
Official implementation of Diffusion Policy Policy Optimization, arxiv 2024
☆841Feb 4, 2025Updated last year
alexanderswerdlow / faster
View on GitHub
☆30Jun 30, 2026Updated 3 weeks ago
ankile / robust-rearrangement
View on GitHub
From Imitation to Refinement -- Residual RL for Precise Assembly
☆246Dec 2, 2025Updated 7 months ago
ajwagen / dsrl
View on GitHub
Official implementation for DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
☆208Aug 5, 2025Updated 11 months ago
cccedric / conrft
View on GitHub
This is the official implementation of the paper "ConRFT: A Reinforced Fine-tuning Method for VLA Models via Consistency Policy".
☆360Mar 30, 2026Updated 3 months ago
rai-opensource / q2rl
View on GitHub
Q-Estimation and Q-Gating from BC for RL
☆45Jul 8, 2026Updated 2 weeks ago
kwanyoungpark / MAC
View on GitHub
Code for Scalable Offline Model-Based RL with Action chunking
☆30Feb 20, 2026Updated 5 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆200Aug 2, 2025Updated 11 months ago
GuanxingLu / vlarl
View on GitHub
Single-file implementation to advance vision-language-action (VLA) models with reinforcement learning.
☆446Nov 8, 2025Updated 8 months ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆321Jul 21, 2025Updated last year
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆124Jul 31, 2024Updated last year
ColinQiyangLi / dqc
View on GitHub
Decoupled Q-Chunking
☆73May 3, 2026Updated 2 months ago
ikostrikov / rlpd
View on GitHub
☆409Feb 13, 2023Updated 3 years ago
gen-robot / RL4VLA
View on GitHub
☆277Aug 25, 2025Updated 11 months ago
aiming-lab / GRAPE
View on GitHub
GRAPE: Guided-Reinforced Vision-Language-Action Preference Optimization
☆160Apr 6, 2025Updated last year
liruiw / HMA
View on GitHub
Learning Real-World Action-Video Dynamics with Heterogeneous Masked Autoregression
☆41Feb 17, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
pd-perry / EXPO
View on GitHub
☆34Aug 25, 2025Updated 11 months ago
amazon-far / residual-offpolicy-rl
View on GitHub
☆143Dec 2, 2025Updated 7 months ago
yufeiwang63 / RL-VLM-F
View on GitHub
Code for Reinforcement Learning from Vision Language Foundation Model Feedback
☆140May 22, 2024Updated 2 years ago
InternRobotics / VLAC
View on GitHub
VLAC: A Vision-Language-Action-Critic Model for Robotic Real-World Reinforcement Learning
☆320Jul 13, 2026Updated last week
younggyoseo / CQN
View on GitHub
Coarse-to-fine Q-Network
☆59Aug 6, 2024Updated last year
Aaditya-Prasad / consistency-policy
View on GitHub
[RSS 2024] Consistency Policy: Accelerated Visuomotor Policies via Consistency Distillation
☆203Jul 20, 2024Updated 2 years ago
microsoft / BST
View on GitHub
☆17May 9, 2025Updated last year
rewind-reward / ReWiND
View on GitHub
☆75Jan 29, 2026Updated 5 months ago
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆442Jan 14, 2026Updated 6 months ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
hengyuan-hu / ibrl
View on GitHub
☆74Sep 23, 2024Updated last year
ReinFlow / ReinFlow
View on GitHub
[NeurIPS 2025] Flow x RL. "ReinFlow: Fine-tuning Flow Policy with Online Reinforcement Learning". Support VLAs e.g., Pi0, Pi0.5, GR00TN1.…
☆348Apr 24, 2026Updated 3 months ago
penn-pal-lab / LIV
View on GitHub
Official repository for "LIV: Language-Image Representations and Rewards for Robotic Control" (ICML 2023)
☆135Oct 19, 2023Updated 2 years ago
Asap7772 / PTR
View on GitHub
This repository contains the implementation of the PTR algorithm described in the paper: Pre-Training for Robots: Leveraging Diverse Mult…
☆32Oct 26, 2022Updated 3 years ago
JiahengHu / FLaRe
View on GitHub
[ICRA 25] FLaRe: Achieving Masterful and Adaptive Robot Policies with Large-Scale Reinforcement Learning Fine-Tuning
☆49Jan 5, 2025Updated last year
PRIME-RL / SimpleVLA-RL
View on GitHub
[ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
☆1,794Jan 6, 2026Updated 6 months ago
Ariostgx / ript-vla
View on GitHub
Interactive Post-Training for Vision-Language-Action Models
☆168Jun 4, 2025Updated last year