Alpha-VLLM / Lumina-mGPT-2.0View external linksLinks
Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling
☆1,078Nov 3, 2025Updated 3 months ago
Alternatives and similar repositories for Lumina-mGPT-2.0
Users that are interested in Lumina-mGPT-2.0 are comparing it to the libraries listed below
Sorting:
- ☆414Mar 10, 2025Updated 11 months ago
- Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…☆638Oct 16, 2025Updated 4 months ago
- ☆2,500Jul 16, 2025Updated 7 months ago
- Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"☆427Jun 20, 2025Updated 7 months ago
- (Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators☆637Nov 10, 2025Updated 3 months ago
- ☆114Apr 25, 2025Updated 9 months ago
- [ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing☆3,638Oct 17, 2025Updated 4 months ago
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆860Nov 3, 2025Updated 3 months ago
- [ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆1,350Sep 12, 2025Updated 5 months ago
- SkyReels-A2: Compose anything in video diffusion transformers☆701Jun 3, 2025Updated 8 months ago
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,544Nov 10, 2025Updated 3 months ago
- Official implementation of BLIP3o-Series☆1,638Nov 29, 2025Updated 2 months ago
- [ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"☆198Jan 7, 2026Updated last month
- [ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,739Nov 28, 2025Updated 2 months ago
- [ICLR 2025] Autoregressive Video Generation without Vector Quantization☆625Oct 29, 2025Updated 3 months ago
- [ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction☆346Apr 9, 2025Updated 10 months ago
- Open-source unified multimodal model☆5,654Oct 27, 2025Updated 3 months ago
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,963Updated this week
- 🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆2,663Aug 22, 2025Updated 5 months ago
- ☆1,048May 14, 2025Updated 9 months ago
- VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning☆270Apr 15, 2025Updated 10 months ago
- [ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…☆277Jan 7, 2026Updated last month
- MAGI-1: Autoregressive Video Generation at Scale☆3,641Jun 17, 2025Updated 8 months ago
- ☆787Jul 17, 2025Updated 7 months ago
- Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and UltraViCo (IC…☆784Feb 2, 2026Updated 2 weeks ago
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformer☆648Oct 16, 2024Updated last year
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆2,137Dec 29, 2025Updated last month
- [CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation☆855May 23, 2025Updated 8 months ago
- Align Anything: Training All-modality Model with Feedback☆4,632Nov 27, 2025Updated 2 months ago
- [AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation☆457Mar 5, 2025Updated 11 months ago
- [NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding☆508Nov 14, 2025Updated 3 months ago
- [ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning☆216Nov 5, 2025Updated 3 months ago
- [NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…☆8,614Nov 10, 2025Updated 3 months ago
- [SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization☆1,745Aug 14, 2025Updated 6 months ago
- Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation☆1,932Aug 15, 2024Updated last year
- Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.☆3,149Dec 15, 2025Updated 2 months ago
- [CVPR 2025] Diffusion Self-Distillation for Zero-Shot Customized Image Generation☆461Mar 18, 2025Updated 10 months ago
- Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…☆7,702Updated this week
- [ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models☆835Dec 17, 2025Updated 2 months ago