[NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
☆72Oct 12, 2025Updated 5 months ago
Alternatives and similar repositories for Flow-Inference-Time-Scaling
Users that are interested in Flow-Inference-Time-Scaling are comparing it to the libraries listed below
Sorting:
- Official implementation for Diffusion Alignment as Sampling (DAS), ICLR'25, Spotlight☆60Feb 12, 2025Updated last year
- Official implementation of SyncTweedies: A General Generative Framework Based on Synchronized Diffusions (NeurIPS 2024)☆70Aug 4, 2024Updated last year
- Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection☆56Aug 16, 2025Updated 7 months ago
- Official implementation of StochSync: a zero-shot approach for image generation in arbitrary spaces via stochastic diffusion synchronizat…☆21Jun 24, 2025Updated 8 months ago
- Official Implementation of Nabla-GFlowNet (ICLR 2025)☆28May 3, 2025Updated 10 months ago
- [NeurIPS 2025] Official code for ORIGEN: Zero-Shot 3D Orientation Grounding in Text-to-Image Generation☆33Oct 17, 2025Updated 5 months ago
- ☆13Jan 22, 2025Updated last year
- Toward Ambulatory Vision: Learning Visually-Grounded Active View Selection☆22Feb 5, 2026Updated last month
- [NeurIPS 2024] Official Implementation of GrounDiT☆59Dec 12, 2024Updated last year
- ☆14Sep 11, 2025Updated 6 months ago
- ☆23Sep 28, 2023Updated 2 years ago
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆131May 16, 2025Updated 10 months ago
- Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark☆28Apr 22, 2025Updated 10 months ago
- Official implementation of HPSv3: Towards Wide-Spectrum Human Preference Score (ICCV2025)☆279Dec 5, 2025Updated 3 months ago
- Official Implementation of "Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling"☆15Nov 20, 2023Updated 2 years ago
- [arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation☆24Oct 6, 2024Updated last year
- Official source codes of "TweedieMix: Improving Multi-Concept Fusion for Diffusion-based Image/Video Generation" (ICLR 2025)☆62Jan 22, 2025Updated last year
- Official implementation of Learning to Discretize Denoising Diffusion ODEs☆33May 21, 2025Updated 9 months ago
- Official Implementation of Posterior Distillation Sampling☆93Jul 7, 2025Updated 8 months ago
- Source code for "A Dense Reward View on Aligning Text-to-Image Diffusion with Preference" (ICML'24).☆40May 9, 2024Updated last year
- 🏞️ Official implementation of "Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition"☆110Nov 24, 2025Updated 3 months ago
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆101Oct 3, 2025Updated 5 months ago
- [ICCV 2025] Official pytorch implementation of "SteerX: Creating Any Camera-Free 3D and 4D Scenes with Geometric Steering"☆50Mar 20, 2025Updated last year
- Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"☆312Sep 28, 2025Updated 5 months ago
- ☆78May 8, 2025Updated 10 months ago
- Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope…☆312Mar 12, 2025Updated last year
- A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.☆217Jun 26, 2025Updated 8 months ago
- paper collection: alignment of diffusion models☆25Mar 6, 2026Updated 2 weeks ago
- Inference-time scaling of diffusion-based image and video generation models.☆172Dec 17, 2025Updated 3 months ago
- [CVPR 2024] InitNO: Boosting Text-to-Image Diffusion Models via Initial Noise Optimization☆77Jun 7, 2024Updated last year
- Official code for paper Advantage Weighted Matching: Aligning RL with Pretraining in Diffusion Models☆59Jan 16, 2026Updated 2 months ago
- Official repository for "CFG++: manifold-constrained classifier free guidance for diffusion models" (ICLR2025)☆238Mar 21, 2025Updated last year
- ☆29May 7, 2025Updated 10 months ago
- The repo for: TriHuman: A Real-time and Controllable Tri-plane Representation for Detailed Human Geometry and Appearance Synthesis☆19Nov 15, 2025Updated 4 months ago
- Official implementation for "Nested Attention: Semantic-aware Attention Values for Concept Personalization" [SIGGRAPH 2025]☆27Aug 4, 2025Updated 7 months ago
- An open source Multi-View Latent Diffusion Model☆42Feb 23, 2026Updated 3 weeks ago
- [CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization☆265Apr 7, 2025Updated 11 months ago
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆97Mar 12, 2025Updated last year
- ☆121Jan 13, 2025Updated last year