CMU-AIRe/floq

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/CMU-AIRe/floq)

CMU-AIRe / floq

Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL

☆46

Alternatives and similar repositories for floq

Users that are interested in floq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

pd-perry / TQL
View on GitHub
☆28May 11, 2026Updated 2 months ago
alexanderswerdlow / faster
View on GitHub
☆29Jun 30, 2026Updated 3 weeks ago
pd-perry / EXPO
View on GitHub
☆34Aug 25, 2025Updated 10 months ago
chongyi-zheng / value-flows
View on GitHub
The official implementation of Value Flows
☆55Feb 27, 2026Updated 4 months ago
rai-opensource / q2rl
View on GitHub
Q-Estimation and Q-Gating from BC for RL
☆45Jul 8, 2026Updated 2 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
seohongpark / horizon-reduction
View on GitHub
The official implementation of "Horizon Reduction Makes RL Scalable"
☆200Aug 2, 2025Updated 11 months ago
ColinQiyangLi / dqc
View on GitHub
Decoupled Q-Chunking
☆73May 3, 2026Updated 2 months ago
naumix / BiggerRegularizedCategorical
View on GitHub
☆17Apr 23, 2026Updated 3 months ago
MaxSobolMark / PolicyAgnosticRL
View on GitHub
☆92Aug 4, 2025Updated 11 months ago
WJ2003B / mqe-release
View on GitHub
Official Release of Multistep Quasimetric Estimation (MQE)
☆18Mar 13, 2026Updated 4 months ago
gauthamvasan / avg
View on GitHub
Action Value Gradient Algorithm
☆28May 18, 2025Updated last year
naumix / BiggerRegularizedOptimistic
View on GitHub
Official implementation of the BRO algorithm
☆61Jan 29, 2025Updated last year
seohongpark / ogbench
View on GitHub
A benchmark for offline goal-conditioned RL and offline RL
☆441Jan 14, 2026Updated 6 months ago
seohongpark / fql
View on GitHub
The official implementation of flow Q-learning (FQL)
☆321Jul 21, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Guozheng-Ma / Adaptive-Replay-Ratio
View on GitHub
[ICLR 2024] Adaptive Replay Ratio implementation from 'Revisiting Plasticity in Visual RL: Data, Modules and Training Stages'.
☆13Oct 9, 2024Updated last year
nakamotoo / Cal-QL
View on GitHub
official implementation for our paper Cal-QL: Calibrated Offline RL Pre-Training for Efficient Online Fine-Tuning (NeurIPS 2023)
☆123Jul 31, 2024Updated last year
amazon-far / residual-offpolicy-rl
View on GitHub
☆143Dec 2, 2025Updated 7 months ago
roger-creus / stable-deep-rl-at-scale
View on GitHub
Code for the paper "Stable Gradients for Stable Learning at Scale in Deep Reinforcement Learning". Great performance in many environments…
☆39Oct 24, 2025Updated 9 months ago
RLE-Foundation / Plasticine
View on GitHub
Accelerating Research in Plasticity-Motivated Deep Reinforcement Learning.
☆44Feb 9, 2026Updated 5 months ago
yardenas / panda-rl-kit
View on GitHub
Deploy RL on your Real-World Franka Emika Panda
☆15Feb 22, 2026Updated 5 months ago
nico-bohlinger / RL-X
View on GitHub
A framework for Reinforcement Learning research.
☆268Updated this week
ColinQiyangLi / qc
View on GitHub
☆395Feb 5, 2026Updated 5 months ago
deepindermann / dual-goal-representations
View on GitHub
The official implementation of "Dual Goal Representations"
☆39Oct 7, 2025Updated 9 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
ajwagen / dsrl
View on GitHub
Official implementation for DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
☆209Aug 5, 2025Updated 11 months ago
ColinQiyangLi / qam
View on GitHub
Q-learning with Adjoint Matching
☆109May 11, 2026Updated 2 months ago
zhouzypaul / wsrl
View on GitHub
JAX implementation of WSRL and RL baselines | ICLR 2025
☆145Feb 26, 2026Updated 4 months ago
mttga / purejaxql
View on GitHub
Simple single-file baselines for Q-Learning in pure-GPU setting
☆242Nov 24, 2025Updated 8 months ago
Elessar123 / SAC-FLOW
View on GitHub
☆66Dec 2, 2025Updated 7 months ago
nakamotoo / dsrl_pi0
View on GitHub
Official implementation for pi0 steering via DSRL, Steering Your Diffusion Policy with Latent Space Reinforcement Learning (CoRL 2025)
☆282Apr 27, 2026Updated 2 months ago
kylestach / dinov2-jax
View on GitHub
Reimplementation of facebook's DinoV2 in JAX. Inference (with pretrained weights) only; training is unsupported.
☆13Jun 25, 2024Updated 2 years ago
Viraj-Joshi / MTBench
View on GitHub
☆45Jul 1, 2026Updated 3 weeks ago
cvoelcker / reppo
View on GitHub
Official Code for "Relative Entropy Pathwise Policy Optimization"
☆59May 6, 2026Updated 2 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
nicklashansen / newt
View on GitHub
Official code repository for the paper "Learning Massively Multitask World Models for Continuous Control".
☆129Jan 9, 2026Updated 6 months ago
rewind-reward / ReWiND
View on GitHub
☆75Jan 29, 2026Updated 5 months ago
aoberai / trl
View on GitHub
Code for "Transitive RL: Value Learning via Divide and Conquer"
☆60Oct 31, 2025Updated 8 months ago
ALRhub / DIME
View on GitHub
☆36Aug 26, 2025Updated 10 months ago
nakamotoo / V-GPS
View on GitHub
official implementation for our paper Steering Your Generalists: Improving Robotic Foundation Models via Value Guidance (CoRL 2024)
☆55Apr 28, 2025Updated last year
typoverflow / flow-rl
View on GitHub
Flow RL is a high-performance RL library with flow and diffusion models.
☆42Jun 16, 2026Updated last month
younggyoseo / FastTD3
View on GitHub
☆456May 16, 2026Updated 2 months ago