π₯ Official impl. of "DetailFlow: 1D Coarse-to-Fine Autoregressive Image Generation via Next-Detail Prediction"
β169Jul 10, 2025Updated 10 months ago
Alternatives and similar repositories for DetailFlow
Users that are interested in DetailFlow are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Stable-Sim2Real: Exploring Simulation of Real-Captured 3D Data with Two-Stage Depth Diffusion (ICCV 2025 Highlight)β29Mar 15, 2026Updated 2 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"β429Jun 20, 2025Updated 11 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"β204Jan 7, 2026Updated 4 months ago
- TPDiff: Temporal Pyramid Video Diffusion Modelβ25Mar 13, 2025Updated last year
- β14Sep 22, 2025Updated 7 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Modelβ13Dec 29, 2024Updated last year
- [CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Modelsβ1,463Dec 16, 2025Updated 5 months ago
- FlexTok: Resampling Images into 1D Token Sequences of Flexible Lengthβ317Jun 2, 2025Updated 11 months ago
- β19Apr 28, 2025Updated last year
- This repository includes the official implementation of our paper "Beyond Next-Token: Next-X Prediction for Autoregressive Visual Generatβ¦β249Oct 12, 2025Updated 7 months ago
- official training and inference code of bitwise tokenizerβ71May 18, 2025Updated last year
- M2-Reasoning: Empowering MLLMs with Unified General and Spatial Reasoningβ47Jul 17, 2025Updated 10 months ago
- ICLR 2026-MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioningβ37Apr 17, 2026Updated last month
- Selftok: Discrete Visual Tokens of Autoregression, by Diffusion, and for Reasoningβ236May 30, 2025Updated 11 months ago
- Managed Database hosting by DigitalOcean β’ AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Official PyTorch Implementation of "Latent Denoising Makes Good Visual Tokenizers"β190Feb 24, 2026Updated 2 months ago
- [ICCV 2025] Code & Data for: SuperEdit - Rectifying and Facilitating Supervision for Instruction-Based Image Editingβ164Jun 26, 2025Updated 10 months ago
- FQGAN: Factorized Visual Tokenization and Generationβ59Mar 29, 2025Updated last year
- [CVPR 2025 Oral]Infinity β : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesisβ1,564Apr 16, 2026Updated last month
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"β54Feb 26, 2026Updated 2 months ago
- [Arxiv 2025] ByteMorph: Benchmarking Instruction-Guided Image Editing with Non-Rigid Motionsβ45Jun 11, 2025Updated 11 months ago
- β19Updated this week
- β320May 29, 2025Updated 11 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understandingβ524Nov 14, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-projectβ185Mar 20, 2025Updated last year
- UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimationβ136Jun 10, 2025Updated 11 months ago
- [ICCV 2025] Official Implementation of Contrastive Flow Matchingβ178Jun 25, 2025Updated 10 months ago
- [Neurips DB 2025] PartNeXt: A Next-Generation Dataset for Fine-Grained and Hierarchical 3D Part Understandingβ99Nov 4, 2025Updated 6 months ago
- [NeurIPS 2025] Vision as a Dialect: Unifying Visual Understanding and Generation via Text-Aligned Representationsβ201Sep 18, 2025Updated 8 months ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Modelsβ52Sep 10, 2025Updated 8 months ago
- Cut2Next: Generating Next Shot via In-Context Tuningβ32Aug 21, 2025Updated 8 months ago
- This repository contains the code for the paper - "Aligning Text, Images, and 3D Structure Token-by-Token" (CVPR 2026)β47Jun 11, 2025Updated 11 months ago
- [ICML 2026] Orienting Latent Actions for Video World Modelingβ98Apr 20, 2026Updated last month
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [CVPR 2025] π₯ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".β460Aug 8, 2025Updated 9 months ago
- Official PyTorch implementation of TokenSet.β129Mar 21, 2025Updated last year
- [ICCV2025] LONG3R: Long Sequence Streaming 3D Reconstructionβ43Jul 25, 2025Updated 9 months ago
- Pixel-Space Generative Modelsβ313May 11, 2025Updated last year
- β132Jun 24, 2025Updated 10 months ago
- [CVPR 2025 GMCV] Test-Time Frequency Scaling: Instant Frequency Control for Any Diffusion Modelβ56May 31, 2025Updated 11 months ago
- [NeurIPS 2025] HermesFlow: Seamlessly Closing the Gap in Multimodal Understanding and Generationβ78Sep 19, 2025Updated 8 months ago