jacklishufan/Reflect-DiT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jacklishufan/Reflect-DiT)

jacklishufan / Reflect-DiT

Reflect-DiT: Inference-Time Scaling for Text-to-Image Diffusion Transformers via In-Context Reflection

☆56

Alternatives and similar repositories for Reflect-DiT

Users that are interested in Reflect-DiT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

KAIST-Visual-AI-Group / Flow-Inference-Time-Scaling
View on GitHub
[NeurIPS 2025] Official code for Inference-Time Scaling for Flow Models via Stochastic Generation and Rollover Budget Forcing
☆75Oct 12, 2025Updated 9 months ago
Tianhao-Qi / Mask2DiT
View on GitHub
CVPR 2025 Accepted Papers
☆26Dec 20, 2025Updated 7 months ago
tinnerhrhe / EvoSearch-codes
View on GitHub
An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search
☆107Oct 3, 2025Updated 9 months ago
Diffusion-CoT / ReflectionFlow
View on GitHub
[ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
☆220Nov 5, 2025Updated 8 months ago
aiming-lab / MJ-Video
View on GitHub
[NeurIPS'25 Spotlight] MJ-VIDEO: Fine-Grained Benchmarking and Rewarding Video Preferences in Video Generation
☆20Feb 23, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
facebookresearch / GenEval2
View on GitHub
Evaluation codes and data for GenEval2
☆80Jan 8, 2026Updated 6 months ago
xiefan-guo / i4vgen
View on GitHub
[arXiv 2024] I4VGen: Image as Free Stepping Stone for Text-to-Video Generation
☆24Oct 6, 2024Updated last year
bcmi / Granular-GRPO
View on GitHub
[CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models
☆64Jun 1, 2026Updated last month
OneIG-Bench / OneIG-Benchmark
View on GitHub
[NeurIPS 2025 DB] OneIG-Bench is a meticulously designed comprehensive benchmark framework for fine-grained evaluation of T2I models acro…
☆120Feb 10, 2026Updated 5 months ago
GongyeLiu / StyleCrafter-SDXL
View on GitHub
Code of StyleCrafter on SDXL
☆20Jun 25, 2024Updated 2 years ago
zai-org / VisionReward
View on GitHub
[AAAI 2026] VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video Generation
☆422Mar 26, 2025Updated last year
rongyaofang / GoT
View on GitHub
Official repository of "GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing"
☆317Sep 28, 2025Updated 9 months ago
KaiyueSun98 / T2I-ReasonBench
View on GitHub
T2I-ReasonBench: Benchmarking Reasoning-Informed Text-to-Image Generation
☆37Sep 16, 2025Updated 10 months ago
ZiyiZhang27 / sdpo
View on GitHub
[IEEE TPAMI] Code for the paper "Aligning Few-Step Diffusion Models with Dense Reward Difference Learning"
☆22Feb 25, 2026Updated 5 months ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
facebookresearch / metamorph
View on GitHub
Code for MetaMorph Multimodal Understanding and Generation via Instruction Tuning
☆235Jan 22, 2026Updated 6 months ago
X-Omni-Team / X-Omni
View on GitHub
Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).
☆426Aug 26, 2025Updated 11 months ago
lzzcd001 / nabla-gfn
View on GitHub
Official Implementation of Nabla-GFlowNet (ICLR 2025)
☆28May 3, 2025Updated last year
tang-bd / fuse-dit
View on GitHub
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆140May 16, 2025Updated last year
jacklishufan / diffusion-kto
View on GitHub
The official implementation of Diffusion-KTO: Aligning Diffusion Models by Optimizing Human Utility
☆69Aug 16, 2025Updated 11 months ago
RockeyCoss / SPO
View on GitHub
[CVPR 2025] Aesthetic Post-Training Diffusion Models from Generic Preferences with Step-by-step Preference Optimization
☆271Apr 7, 2025Updated last year
SHI-Labs / T2I-Copilot
View on GitHub
T2I-Copilot: A Training-Free Multi-Agent Text-to-Image System for Enhanced Prompt Interpretation and Interactive Generation (ICCV'25)
☆57Oct 6, 2025Updated 9 months ago
ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆865Mar 19, 2026Updated 4 months ago
jiawn-creator / Dynamic-DiT
View on GitHub
☆18Mar 21, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
zhentao-zou / MURE
View on GitHub
Beyond Textual CoT: Interleaved Text-image chains with Deep Confidence Reasoning for Image Editing
☆19Jun 24, 2026Updated last month
PicoTrex / GPT-ImgEval
View on GitHub
GPT-ImgEval: Evaluating GPT-4o’s state-of-the-art image generation capabilities
☆307May 3, 2025Updated last year
KlingAIResearch / DiffMoE
View on GitHub
[Arxiv 2025] Official PyTorch implementation of DiffMoE, TC-DiT, EC-DiT and Dense DiT
☆175Oct 21, 2025Updated 9 months ago
sayakpaul / tt-scale-flux
View on GitHub
Inference-time scaling of diffusion-based image and video generation models.
☆174Dec 17, 2025Updated 7 months ago
zhengdian1 / AIA
View on GitHub
☆45Jan 4, 2026Updated 6 months ago
CodeGoat24 / LiFT
View on GitHub
Official implementation of LiFT: Leveraging Human Feedback for Text-to-Video Model Alignment.
☆85May 4, 2025Updated last year
ali-vilab / Ranni
View on GitHub
☆237Apr 10, 2024Updated 2 years ago
wangf3014 / Patch_Scaling
View on GitHub
Official implementation of Scaling Laws in Patchification: An Image Is Worth 50,176 Tokens And More
☆25Feb 25, 2025Updated last year
zacharyhorvitz / Fk-Diffusion-Steering
View on GitHub
A general framework for inference-time scaling and steering of diffusion models with arbitrary rewards.
☆226Jun 26, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,430May 7, 2026Updated 2 months ago
Doubiiu / MotionCanvas
View on GitHub
[SIGGRAPH 2025] MotionCanvas: Cinematic Shot Design with Controllable Image-to-Video Generation
☆36Aug 5, 2025Updated 11 months ago
Vchitect / TACA
View on GitHub
[ICCV25] TACA: Rethinking Cross-Modal Interaction in Multimodal Diffusion Transformers
☆42Jul 23, 2025Updated last year
jinzcdev / occupied-gpus
View on GitHub
The program used to occupy GPUs.
☆10Mar 24, 2023Updated 3 years ago
sjz5202 / LLaVA-Reward
View on GitHub
Official repository for LLaVA-Reward (ICCV 2025): Multimodal LLMs as Customized Reward Models for Text-to-Image Generation
☆26Jul 30, 2025Updated 11 months ago
Shuweis / ResMaster
View on GitHub
☆63Jun 25, 2024Updated 2 years ago
facebookresearch / MMRB2
View on GitHub
Data and sample evaluation codes for Multimodal Rewardbench 2
☆147Dec 20, 2025Updated 7 months ago