VisionChengzhuo/CoF-T2I

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/VisionChengzhuo/CoF-T2I)

VisionChengzhuo / CoF-T2I

Video models as pure visual reasoners for high-quality text-to-image generation via Chain-of-Frame reasoning.

☆39

Alternatives and similar repositories for CoF-T2I

Users that are interested in CoF-T2I are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Ryann-Ran / Scone
View on GitHub
(CVPR 2026 Highlight) Official repository for Scone (Subject-driven COmposition and DistinctioN Enhancement) model, supporting subject co…
☆32Apr 9, 2026Updated 3 months ago
Zzz99999 / Video-Zero
View on GitHub
☆19May 15, 2026Updated 2 months ago
arielshaulov / TokenTrim
View on GitHub
Official implementation of the paper "TOKENTRIM: INFERENCE-TIME TOKEN PRUNING FOR AUTOREGRESSIVE LONG VIDEO GENERATION"
☆15Feb 8, 2026Updated 5 months ago
ictnlp / GMA
View on GitHub
Code for ACL 2022 findings paper "Gaussian Multi-head Attention for Simultaneous Machine Translation"
☆11Mar 31, 2022Updated 4 years ago
linYDTHU / StableVelocity
View on GitHub
[ICML 2026] Stable Velocity: A Variance Perspective on Flow Matching
☆29Feb 19, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
bcmi / Granular-GRPO
View on GitHub
[CVPR 2026] Fine-Grained GRPO for Precise Preference Alignment in Flow Models
☆64Jun 1, 2026Updated last month
byhuang123 / PoCo
View on GitHub
[CVPR2026] Official implementation of our paper “Rethinking Position Embedding as a Context Controller for Multi-Reference and Multi-Shot…
☆19Apr 8, 2026Updated 3 months ago
dingyue772 / OmniSIFT
View on GitHub
[ICML2026] OmniSIFT: Modality-Asymmetric Token Compression for Efficient Omni-modal Large Language Models
☆25May 21, 2026Updated 2 months ago
SAIS-FUXI / Omni-Video
View on GitHub
☆156Feb 28, 2026Updated 4 months ago
appletea233 / EditThinker
View on GitHub
Unlocking Iterative Reasoning for Any Image Editor
☆111Jan 18, 2026Updated 6 months ago
llm-conditioned-diffusion / OmniDiffusion
View on GitHub
☆14Jul 17, 2024Updated 2 years ago
deepshwang / crepa
View on GitHub
☆15Jun 21, 2025Updated last year
jiaosiyuu / ThinkGen
View on GitHub
ThinkGen: Generalized Thinking for Visual Generation
☆60Dec 30, 2025Updated 6 months ago
Phantom-video / LibraGen
View on GitHub
☆17Mar 19, 2026Updated 4 months ago
Simple, predictable pricing with DigitalOcean hosting • Ad
Always know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
fudan-generative-vision / MixFlow
View on GitHub
[CVPR 2026] MixFlow Training: Alleviating Exposure Bias with Slowed Interpolation Mixture
☆21Dec 23, 2025Updated 6 months ago
Henry-Lee-real / StableI2I
View on GitHub
Official implementation of StableI2I （ICML 2026）
☆19May 11, 2026Updated 2 months ago
KlingAIResearch / VANS
View on GitHub
[CVPR 2026] Video-as-Answer: Predict and Generate Next Video Event with Joint-GRPO
☆119Feb 28, 2026Updated 4 months ago
SAIS-FUXI / IPO
View on GitHub
☆58May 6, 2025Updated last year
Matt-Su / DR-Adapter
View on GitHub
☆22May 12, 2024Updated 2 years ago
G-U-N / UniRL
View on GitHub
[ICML 2026] a unified reinforcement learning toolbox for joint RL on language models and diffusion models
☆91May 26, 2026Updated last month
Osilly / Interleaving-Reasoning-Generation
View on GitHub
[ICLR 2026] This is an early exploration to introduce Interleaving Reasoning to Text-to-image Generation field and achieve the SoTA bench…
☆100Jan 26, 2026Updated 5 months ago
jha-lab / LinGen
View on GitHub
☆30Jun 9, 2025Updated last year
Fr0zenCrane / Uni-ViGU
View on GitHub
Uni-ViGU: Towards Unified Video Generation and Understanding via A Diffusion-Based Video Generator
☆33Apr 15, 2026Updated 3 months ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
SJTU-DENG-Lab / Think-Then-Generate
View on GitHub
☆115Jul 1, 2026Updated 3 weeks ago
KlingAIResearch / StereoPilot
View on GitHub
The official implementation of StereoPilot
☆116Dec 19, 2025Updated 7 months ago
yongliu20 / Awesome-Unified-Understanding-and-Generation
View on GitHub
☆52Aug 22, 2025Updated 11 months ago
SJTU-DENG-Lab / LatentUM
View on GitHub
☆56Apr 9, 2026Updated 3 months ago
Shredded-Pork / Flash-GRPO
View on GitHub
[ICML 2026] Flash-GRPO: Efficient Alignment for Video Diffusion via One-Step Policy Optimization
☆58Jun 11, 2026Updated last month
Eyeline-Labs / VChain
View on GitHub
[ACL 2026 Findings, ICCV 2025 Workshop Outstanding Paper Award] VChain: Chain-of-Visual-Thought for Reasoning in Video Generation
☆120Apr 8, 2026Updated 3 months ago
HL-hanlin / V-Co
View on GitHub
Official implementation of V-Co: A Closer Look at Visual Representation Alignment via Co-Denoising (ECCV 2026)
☆27Jun 29, 2026Updated 3 weeks ago
Video-Reason / Awesome-Video-Reasoning
View on GitHub
This is a collection of recent papers on reasoning in video generation models.
☆164Updated this week
KlingAIResearch / VMoBA
View on GitHub
Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"
☆64Jul 1, 2025Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Bluear7878 / H2-Cache-A-Hierarchical-Dual-Stage-Cache
View on GitHub
☆22Nov 3, 2025Updated 8 months ago
bytedance-fanqie-ai / MoGA
View on GitHub
Mixture-of-Groups Attention for End-to-End Long Video Generation
☆99Oct 22, 2025Updated 9 months ago
AIGeeksGroup / UniVid
View on GitHub
UniVid: The Open-Source Unified Video Model
☆32Oct 13, 2025Updated 9 months ago
ABU121111 / DreamWorld
View on GitHub
DreamWorld: Unified World Modeling in Video Generation
☆60Mar 24, 2026Updated 3 months ago
thunderbolt215 / UniPercept
View on GitHub
[ICML2026 Spotlight] UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture
☆155Jul 13, 2026Updated last week
xh9998 / DiffVSR
View on GitHub
☆59Oct 15, 2025Updated 9 months ago
GeekGuru123 / ProfilingDiT
View on GitHub
☆20Jan 1, 2026Updated 6 months ago