UltraFlux: Data-Model Co-Design for High-quality Native 4K Text-to-Image Generation across Diverse Aspect Ratios
☆117Dec 17, 2025Updated 2 months ago
Alternatives and similar repositories for UltraFlux
Users that are interested in UltraFlux are comparing it to the libraries listed below
Sorting:
- Implementation of "VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space" - ECCV 2024☆13Mar 24, 2025Updated 11 months ago
- RePlan: Reasoning-Guided Region Planning for Complex Instruction-Based Image Editing☆58Dec 26, 2025Updated 2 months ago
- [NeurIPS 2025 Spotlight] DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling☆62Feb 12, 2026Updated 2 weeks ago
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- Implementation of the Mesh-VQVAE of "VQ-HPS: Human Pose and Shape Estimation in a Vector-Quantized Latent Space" - ECCV 2024☆17Oct 30, 2024Updated last year
- ☆34Mar 18, 2025Updated 11 months ago
- Official pytorch implementation of "AlphaFlow: Understanding and Improving MeanFlow Models"☆98Oct 24, 2025Updated 4 months ago
- [ICLR 2026] PixNerd: Pixel Neural Field Diffusion☆170Dec 10, 2025Updated 2 months ago
- ☆21Dec 14, 2025Updated 2 months ago
- FIBO-Edit brings the power of structured prompt generation to image editing☆27Jan 29, 2026Updated last month
- ☆35Dec 16, 2025Updated 2 months ago
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026)☆53Updated this week
- [ICLR 2026] pi-Flow: Policy-Based Few-Step Generation via Imitation Distillation☆268Updated this week
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆85Sep 18, 2025Updated 5 months ago
- ☆63Jul 11, 2025Updated 7 months ago
- Official code for VINCIE: Unlocking In-context Image Editing from Video☆48Sep 8, 2025Updated 5 months ago
- ☆43Dec 1, 2025Updated 2 months ago
- The official implementation of COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence.☆28Dec 30, 2025Updated 2 months ago
- ☆43Sep 1, 2025Updated 6 months ago
- Pose Extraction & Rendering for SCAIL: Towards Studio-Grade Character Animation via In-Context Learning of 3D-Consistent Pose Representat…☆180Dec 28, 2025Updated 2 months ago
- JoVA: Unified Multimodal Learning for Joint Video-Audio Generation☆30Dec 22, 2025Updated 2 months ago
- This is the official implementation of our paper: “VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame In…☆14Dec 5, 2025Updated 2 months ago
- Use claude code anywhere.☆42Feb 12, 2026Updated 2 weeks ago
- G-Buffer-Conditioned Diffusion for Neural Forward Frame Rendering.☆23Jan 31, 2026Updated last month
- Kaleido: Open-sourced multi-subject reference video generation model, enabling controllable, high-fidelity video synthesis from multiple …☆115Dec 23, 2025Updated 2 months ago
- Reinforcing Text-Rich Video Reasoning with Visual Rumination☆27Nov 24, 2025Updated 3 months ago
- ☆37Oct 29, 2025Updated 4 months ago
- [CVPR 2026] Official Implementation of "Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models".☆15Updated this week
- [NeurIPS 2025] Encoder-Decoder Diffusion Language Models for Efficient Training and Inference☆36Oct 29, 2025Updated 4 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆50Nov 4, 2025Updated 3 months ago
- Official repo for paper "EMMA: Efficient Multimodal Understanding, Generation, and Editing with a Unified Architecture."☆61Dec 16, 2025Updated 2 months ago
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆121Mar 4, 2025Updated 11 months ago
- ☆34Jan 25, 2026Updated last month
- Official Pytorch implementation of "Omni-AVSR: Towards Unified Multimodal Speech Recognition with Large Language Models" [IEEE ICASSP 202…☆29Jan 18, 2026Updated last month
- [ICLR 2026] ContextGen: Contextual Layout Anchoring for Identity-Consistent Multi-Instance Generation☆67Feb 12, 2026Updated 2 weeks ago
- Official Repo for Tuning-Free Noise Rectification for High Fidelity Image-to-Video Generation☆30Mar 29, 2024Updated last year
- This repository provides an improved LLamaGen Model, fine-tuned on 500,000 high-quality images, each accompanied by over 300 token prompt…☆30Oct 21, 2024Updated last year
- UNCAGE: Contrastive Attention Guidance for Masked Generative Transformers in Text-to-Image Generation☆18Aug 12, 2025Updated 6 months ago
- LoPA: Scaling dLLM Inference via Lookahead Parallel Decoding☆34Jan 16, 2026Updated last month