π₯ Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
β170Jul 10, 2025Updated 10 months ago
Alternatives and similar repositories for DetailFlow
Users that are interested in DetailFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion (ICCV 2025 Highlight)β29Mar 15, 2026Updated 2 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"β429Jun 20, 2025Updated 11 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"β205Jan 7, 2026Updated 5 months ago
- TPDiff: Temporal Pyramid Video Diffusion Modelβ25Mar 13, 2025Updated last year
- β14Sep 22, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Modelβ13Dec 29, 2024Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Modelsβ1,487Dec 16, 2025Updated 5 months ago
- β19Apr 28, 2025Updated last year
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generatβ¦β250Oct 12, 2025Updated 7 months ago
- official training and inference code of bitwise tokenizerβ71May 18, 2025Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoningβ48Jul 17, 2025Updated 10 months ago
- ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioningβ38Apr 17, 2026Updated last month
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoningβ236May 30, 2025Updated last year
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"β193Feb 24, 2026Updated 3 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editingβ165Jun 26, 2025Updated 11 months ago
- FQGAN: Factorized Visual Tokenization and Generationβ59Mar 29, 2025Updated last year
- [CVPR 2025 Oral]Infinity β : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesisβ1,570Apr 16, 2026Updated last month
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motionsβ45Jun 11, 2025Updated 11 months ago
- β19Jun 2, 2026Updated last week
- β321May 29, 2025Updated last year
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understandingβ525Nov 14, 2025Updated 6 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ186Mar 20, 2025Updated last year
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimationβ136Jun 10, 2025Updated 11 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [ICCV 2025] Official Implementation of Contrastive Flow Matchingβ180Jun 25, 2025Updated 11 months ago
- [Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understandingβ100Nov 4, 2025Updated 7 months ago
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RLβ2,325May 7, 2026Updated last month
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representationsβ201Sep 18, 2025Updated 8 months ago
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"β57Feb 26, 2026Updated 3 months ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Modelsβ52Sep 10, 2025Updated 8 months ago
- Cut2Next: Generating Next Shot via In-Context Tuningβ33Aug 21, 2025Updated 9 months ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)β48Jun 11, 2025Updated 11 months ago
- [ICML 2026] Orienting Latent Actions for Video World Modelingβ105Apr 20, 2026Updated last month
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- [CVPR 2025] π₯ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".β464Aug 8, 2025Updated 10 months ago
- Official PyTorch implementation of TokenSet.β129Mar 21, 2025Updated last year
- [ICCV2025] LONG3R: Long Sequence Streaming 3D Reconstructionβ43Jul 25, 2025Updated 10 months ago
- Pixel-Space Generative Modelsβ315May 11, 2025Updated last year
- β130Jun 24, 2025Updated 11 months ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generationβ78Sep 19, 2025Updated 8 months ago
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Modelβ55May 31, 2025Updated last year