Tonghe-Zhang / Awesome-Flow-RL-PapersView external linksLinks
A collection of paper/projects that trains flow matching model/policies via RL.
☆361Dec 25, 2025Updated last month
Alternatives and similar repositories for Awesome-Flow-RL-Papers
Users that are interested in Awesome-Flow-RL-Papers are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆1,998Nov 4, 2025Updated 3 months ago
- Implementation of Flow Policy Optimization (FPO)☆353Jan 13, 2026Updated last month
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆638Feb 10, 2026Updated last week
- ☆63Jul 10, 2025Updated 7 months ago
- Evaluation codes and data for GenEval2☆55Jan 8, 2026Updated last month
- Code and Data for "GenAI Arena: An Open Evaluation Platform for Generative Models" [NeurIPS 2024]☆34Sep 8, 2024Updated last year
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆430Sep 18, 2025Updated 4 months ago
- This repository summarizes recent advances in the VLA + RL paradigm and provides a taxonomic classification of relevant works.☆387Oct 10, 2025Updated 4 months ago
- Training Autoregressive Image Generation models via Reinforcement Learning☆50Nov 26, 2025Updated 2 months ago
- ☆10Nov 18, 2024Updated last year
- A Quasi-Wasserstein Loss for Learning Graph Neural Networks (QW loss)☆10May 20, 2024Updated last year
- ☆328Sep 15, 2025Updated 5 months ago
- ☆66Aug 13, 2025Updated 6 months ago
- Official implementation of HEAD CoRL 2025☆24Aug 22, 2025Updated 5 months ago
- NeurIPS 2024☆14Oct 29, 2025Updated 3 months ago
- EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling☆213Feb 3, 2026Updated 2 weeks ago
- The official repository of EffiVED☆19Jun 5, 2024Updated last year
- RLHF for Video Diffusion Models☆23Jul 30, 2025Updated 6 months ago
- Cosmos Policy☆510Jan 23, 2026Updated 3 weeks ago
- Official implementation of Pref-GRPO: Pairwise Preference Reward-based GRPO for Stable Text-to-Image Reinforcement Learning☆230Feb 10, 2026Updated last week
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆427Jun 20, 2025Updated 7 months ago
- ☆51Feb 28, 2025Updated 11 months ago
- A curated list of Diffusion Model in RL resources (continually updated)☆1,525Dec 15, 2025Updated 2 months ago
- Code Release for floq: Training Critics via Flow-Matching for Scaling Compute In Value-Based RL☆31Feb 7, 2026Updated last week
- ☆19May 20, 2025Updated 8 months ago
- OKAMI: Teaching Humanoid Robots Manipulation Skills through Single Video Imitation☆32Jun 18, 2025Updated 8 months ago
- ☆23Dec 16, 2025Updated 2 months ago
- paper list, tutorial, and nano code snippet for Diffusion Large Language Models.☆154Jan 19, 2026Updated 3 weeks ago
- ☆350Feb 5, 2026Updated last week
- The code implementation for the paper "DreamLifting: A Plug-in Module Lifting MV Diffusion Models for 3D Asset Generation".☆29Sep 1, 2025Updated 5 months ago
- [ACMMM 2025 - Dataset Track] ComplexBench-Edit: Benchmarking Complex Instruction-Driven Image Editing via Compositional Dependencies☆22Jun 20, 2025Updated 7 months ago
- Official Implementation for Quality-Similar Diversity via Population Based Reinforcement Learning☆19Dec 26, 2025Updated last month
- Native Multimodal Models are World Learners☆1,456Dec 30, 2025Updated last month
- Official implementation of Diffusion Policy Policy Optimization, arxiv 2024☆756Feb 4, 2025Updated last year
- ☆23Oct 9, 2024Updated last year
- [IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"☆19Feb 8, 2026Updated last week
- [NeurIPS 2025] Improving Video Generation with Human Feedback☆424Sep 24, 2025Updated 4 months ago
- ☆87Aug 4, 2025Updated 6 months ago
- [ECCV 2024] 💐Official implementation of the paper "Diffusion Reward: Learning Rewards via Conditional Video Diffusion"☆117Jul 2, 2024Updated last year