ByteFlow-AI / DetailFlowLinks

🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"

☆111

Alternatives and similar repositories for DetailFlow

Users that are interested in DetailFlow are comparing it to the libraries listed below

Sorting:

zhaoshitian / LeX-Art
Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"
☆62Updated 2 months ago
mycfhs / DreamMix
The official implementation of paper: DreamMix: Decoupling Object Attributes for Enhanced Editability in Customized Image Inpainting
☆120Updated 5 months ago
Yaofang-Liu / Pusa-VidGen
Pusa: Thousands Timesteps Video Diffusion Model
☆199Updated this week
YujiaHu1109 / IEAP
IEAP: Image Editing As Programs with Diffusion Models
☆93Updated 3 weeks ago
alexanderswerdlow / unidisc
UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, a…
☆107Updated 2 months ago
ali-vilab / FreeScale
Code for FreeScale, a tuning-free method for higher-resolution visual generation
☆126Updated 3 months ago
YuqingWang1029 / PAR
[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project
☆164Updated 3 months ago
Gengzigang / TokenSet
Official PyTorch implementation of TokenSet.
☆121Updated 3 months ago
TIGER-AI-Lab / OmniEdit
Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]
☆121Updated 4 months ago
chenllliang / DreamEngine
Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!
☆113Updated 3 months ago
sayakpaul / tt-scale-flux
Inference-time scaling of diffusion-based image and video generation models.
☆151Updated 3 months ago
illume-unified-mllm / ILLUME_plus
☆105Updated last week
Gen-Verse / Diffusion-Sharpening
Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening
☆62Updated last month
NJU-PCALab / TextCrafter
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes
☆66Updated 2 months ago
Kmcode1 / SG-I2V
This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.
☆109Updated 7 months ago
YBYBZhang / VideoElevator
[AAAI 2025] Official pytorch implementation of "VideoElevator: Elevating Video Generation Quality with Versatile Text-to-Image Diffusion …
☆158Updated last year
tang-bd / fuse-dit
[CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis
☆109Updated last month
ShoufaChen / PixelFlow
Pixel-Space Generative Models
☆250Updated last month
dvlab-research / Jenga
Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving
☆203Updated 2 weeks ago
lseancs / GenerativePhotomontage
☆84Updated 10 months ago
weijiawu / ParaDiffusion
[IJCV 2025] Paragraph-to-Image Generation with Information-Enriched Diffusion Model
☆104Updated 3 months ago
Correr-Zhou / MagicTailor
[IJCAI 2025] Offical implementation of the paper "MagicTailor: Component-Controllable Personalization in Text-to-Image Diffusion Models"…
☆87Updated last month
EnVision-Research / OmniBooth
☆131Updated 3 months ago
wz0919 / DreamRunner
Official implementation of DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation
☆70Updated 2 weeks ago
desaixie / pa_vdm
CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151
☆72Updated last month
microsoft / Reducio-VAE
☆196Updated 4 months ago
TencentARC / FluxKits
☆90Updated 6 months ago
Vchitect / RepVideo
The official implementation of ”RepVideo: Rethinking Cross-Layer Representation for Video Generation“
☆117Updated 5 months ago
Dawn-LX / CausalCache-VDM
Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …
☆62Updated last month
XuweiyiChen / UniCtrl
Official implementation of UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models via Training-Free Unified …
☆69Updated 6 months ago