π₯ Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
β170Jul 10, 2025Updated 11 months ago
Alternatives and similar repositories for DetailFlow
Users that are interested in DetailFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion (ICCV 2025 Highlight)β31Mar 15, 2026Updated 3 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"β429Jun 20, 2025Updated last year
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"β204Jan 7, 2026Updated 5 months ago
- TPDiff: Temporal Pyramid Video Diffusion Modelβ25Mar 13, 2025Updated last year
- β14Sep 22, 2025Updated 9 months ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Modelβ13Dec 29, 2024Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Modelsβ1,498Dec 16, 2025Updated 6 months ago
- β19Apr 28, 2025Updated last year
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Lengthβ321Jun 2, 2025Updated last year
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generatβ¦β251Oct 12, 2025Updated 8 months ago
- official training and inference code of bitwise tokenizerβ72May 18, 2025Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoningβ48Jul 17, 2025Updated 11 months ago
- ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioningβ38Apr 17, 2026Updated 2 months ago
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoningβ239May 30, 2025Updated last year
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"β194Feb 24, 2026Updated 4 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editingβ165Jun 26, 2025Updated last year
- FQGAN: Factorized Visual Tokenization and Generationβ59Mar 29, 2025Updated last year
- [CVPR 2025 Oral]Infinity β : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesisβ1,576Apr 16, 2026Updated 2 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motionsβ45Jun 11, 2025Updated last year
- β20Jun 2, 2026Updated 3 weeks ago
- β322May 29, 2025Updated last year
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understandingβ527Nov 14, 2025Updated 7 months ago
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ186Mar 20, 2025Updated last year
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimationβ136Jun 10, 2025Updated last year
- [ICCV 2025] Official Implementation of Contrastive Flow Matchingβ182Jun 25, 2025Updated last year
- [Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understandingβ101Nov 4, 2025Updated 7 months ago
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representationsβ202Sep 18, 2025Updated 9 months ago
- [NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RLβ2,369May 7, 2026Updated last month
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"β60Feb 26, 2026Updated 4 months ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Modelsβ53Sep 10, 2025Updated 9 months ago
- Cut2Next: Generating Next Shot via In-Context Tuningβ33Aug 21, 2025Updated 10 months ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)β48Jun 11, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICML 2026] Orienting Latent Actions for Video World Modelingβ107Apr 20, 2026Updated 2 months ago
- [CVPR 2025] π₯ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".β465Aug 8, 2025Updated 10 months ago
- Official PyTorch implementation of TokenSet.β129Mar 21, 2025Updated last year
- [ICCV2025] LONG3R: Long Sequence Streaming 3D Reconstructionβ43Jul 25, 2025Updated 11 months ago
- Pixel-Space Generative Modelsβ315May 11, 2025Updated last year
- β131Jun 24, 2025Updated last year
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generationβ78Sep 19, 2025Updated 9 months ago