A unified framework for easy reinforcement learning in Flow-Matching models
☆266Mar 20, 2026Updated this week
Alternatives and similar repositories for Flow-Factory
Users that are interested in Flow-Factory are comparing it to the libraries listed below
Sorting:
- Reinforcement Learning Framework for Visual Generation☆87Feb 13, 2026Updated last month
- Cut2Next: Generating Next Shot via In-Context Tuning☆31Aug 21, 2025Updated 6 months ago
- [ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process☆718Feb 10, 2026Updated last month
- Official Repo for "Why Settle for One? Text-to-ImageSet Generation and Evaluation"☆21Oct 1, 2025Updated 5 months ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆32Dec 13, 2025Updated 3 months ago
- ☆69Aug 13, 2025Updated 7 months ago
- A survey for visual generation alignment☆126Nov 9, 2025Updated 4 months ago
- 🚀 [ICLR 2026] SenseFlow: Scaling Distribution Matching for Flow-based Text-to-Image Distillation☆72Updated this week
- PyTorch code and model checkpoints for Score identity Distillation (SiD) and its adversarial version (SiDA)☆151Mar 29, 2025Updated 11 months ago
- ☆63Jul 11, 2025Updated 8 months ago
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"☆49Feb 26, 2026Updated 3 weeks ago
- UniWorld: High-Resolution Semantic Encoders for Unified Visual Understanding and Generation☆859Dec 23, 2025Updated 2 months ago
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL☆2,073Nov 4, 2025Updated 4 months ago
- [CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models☆51Feb 21, 2026Updated 3 weeks ago
- [ICCV'25] When Large Vision-Language Model Meets Large Remote Sensing Imagery: Coarse-to-Fine Text-Guided Token Pruning☆49Feb 16, 2026Updated last month
- “SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity” by Peihao Wang, Zhiwen Fan, Dejia Xu, Dilin Wang,…☆35Jan 5, 2024Updated 2 years ago
- paper collection: alignment of diffusion models☆25Mar 6, 2026Updated 2 weeks ago
- [CVPR 2025] ChatGen: Automatic Text-to-Image Generation From FreeStyle Chatting☆33Dec 5, 2024Updated last year
- a tools to process human 3D model, such as model visualization, model fitting, etc.☆14Mar 14, 2022Updated 4 years ago
- RLHF for Video Diffusion Models☆26Jul 30, 2025Updated 7 months ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆28May 3, 2025Updated 10 months ago
- Official implementation of UnifiedReward & [NeurIPS 2025] UnifiedReward-Think & UnifiedReward-Flex☆744Updated this week
- [NeurIPS 2025] PyTorch Implementation of "LEDiT: Your Length-Extrapolatable Diffusion Transformer without Positional Encoding"☆25Oct 27, 2025Updated 4 months ago
- Toolbox for GTA-Human Datasets☆25Oct 9, 2024Updated last year
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆131May 16, 2025Updated 10 months ago
- This is the official repository for the paper "FLUX-Reason-6M & PRISM-Bench: A Million-Scale Text-to-Image Reasoning Dataset and Comprehe…☆128Jan 29, 2026Updated last month
- ☆41Oct 29, 2025Updated 4 months ago
- [NeurIPS 2025] T2I-R1: Reinforcing Image Generation with Collaborative Semantic-level and Token-level CoT☆432Sep 18, 2025Updated 6 months ago
- ☆28Apr 8, 2025Updated 11 months ago
- Official codes for the paper "GARDO: Reinforcing Diffusion Models without Reward Hacking"☆56Feb 2, 2026Updated last month
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆426Jun 20, 2025Updated 9 months ago
- [CVPR2025] Extrapolating and Decoupling Image-to-Video Generation Models: Motion Modeling is Easier Than You Think☆23Jul 1, 2025Updated 8 months ago
- EditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]☆130Feb 6, 2026Updated last month
- TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes☆94Nov 26, 2025Updated 3 months ago
- [ICLR 2026] JanusCoder: Towards a Foundational Visual-Programmatic Interface for Code Intelligence☆77Feb 9, 2026Updated last month
- PICABench: How Far Are We from Physically Realistic Image Editing?☆36Nov 5, 2025Updated 4 months ago
- The official github repo for "Training Optimal Large Diffusion Language Models", the first-ever large-scale diffusion language models sca…☆45Nov 6, 2025Updated 4 months ago
- A collection of diffusion models based on FLUX/DiT for image/video generation, editing, reconstruction, inpainting .etc.☆86Jun 20, 2025Updated 9 months ago
- Official Implementation of Paper Transfer between Modalities with MetaQueries☆310Oct 12, 2025Updated 5 months ago