ByteVisionLab / DetailFlowLinks
🔥 Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
☆151Updated last month
Alternatives and similar repositories for DetailFlow
Users that are interested in DetailFlow are comparing it to the libraries listed below
Sorting:
- [CVPR 2025] Exploring the Deep Fusion of Large Language Models and Diffusion Transformers for Text-to-Image Synthesis☆118Updated 3 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project☆172Updated 5 months ago
- ☆204Updated 6 months ago
- I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models☆179Updated 6 months ago
- Official Implementation: Training-Free Efficient Video Generation via Dynamic Token Carving☆232Updated 3 weeks ago
- Diffusion-Sharpening: Fine-tuning Diffusion Models with Denoising Trajectory Sharpening☆64Updated 3 months ago
- Pixel-Space Generative Models☆268Updated 3 months ago
- GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset☆223Updated 2 weeks ago
- the official repo for "D-AR: Diffusion via Autoregressive Models"☆111Updated 2 months ago
- Lumos Project: Frontier generative model research by Alibaba DAMO Academy, including Lumos-1, etc.☆127Updated last month
- ☆118Updated last week
- ☆98Updated 2 weeks ago
- Inference-time scaling of diffusion-based image and video generation models.☆167Updated 2 months ago
- [ICCV 2025] Code for FreeScale, a tuning-free method for higher-resolution visual generation☆134Updated this week
- This is the official implementation of SG-I2V: Self-Guided Trajectory Control in Image-to-Video Generation.☆112Updated 9 months ago
- The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"☆53Updated 4 months ago
- CVPRW 2025 paper Progressive Autoregressive Video Diffusion Models: https://arxiv.org/abs/2410.08151☆82Updated 3 months ago
- Official PyTorch Implementation for Dual-Process Image Generation, ICCV 2025☆86Updated this week
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆118Updated 5 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆178Updated 2 months ago
- Official Repo for Paper "OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision" [ICLR2025]☆126Updated 7 months ago
- Official implementation of our paper: "Ca2-VDM: Efficient Autoregressive Video Diffusion Model with Causal Generation and Cache Sharing" …☆67Updated 3 months ago
- Official Implementation of weights2weights☆147Updated 5 months ago
- IEAP: Image Editing As Programs with Diffusion Models☆98Updated last month
- Subjects200K dataset☆117Updated 7 months ago
- Vico: Compositional Video Generation as Flow Equalization☆58Updated 9 months ago
- ☆128Updated 2 months ago
- Official inference code and LongText-Bench benchmark for our paper X-Omni (https://arxiv.org/pdf/2507.22058).☆356Updated last week
- Official implemention of "Make It Count: Text-to-Image Generation with an Accurate Number of Objects" (CVPR 2025)☆83Updated 5 months ago
- Official PyTorch implementation for ICLR2024 paper "The Blessing of Randomness: SDE Beats ODE in General Diffusion-based Image Editing"☆111Updated last year