Alpha-VLLM / Lumina-mGPT-2.0
View external linksLinks

Lumina-mGPT 2.0: Stand-Alone AutoRegressive Image Modeling

☆1,078

Alternatives and similar repositories for Lumina-mGPT-2.0

Users that are interested in Lumina-mGPT-2.0 are comparing it to the libraries listed below

Sorting:

Alpha-VLLM / Lumina-Video
View on GitHub
☆414Mar 10, 2025Updated 11 months ago
Alpha-VLLM / Lumina-mGPT
View on GitHub
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…
☆638Oct 16, 2025Updated 4 months ago
HiDream-ai / HiDream-I1
View on GitHub
☆2,500Jul 16, 2025Updated 7 months ago
wdrink / SimpleAR
View on GitHub
Pytorch implementation for the paper titled "SimpleAR: Pushing the Frontier of Autoregressive Visual Generation"
☆427Jun 20, 2025Updated 7 months ago
FoundationVision / Liquid
View on GitHub
(Accepted by IJCV) Liquid: Language Models are Scalable and Unified Multi-modal Generators
☆637Nov 10, 2025Updated 3 months ago
Alpha-VLLM / Lumina-Accessory
View on GitHub
☆114Apr 25, 2025Updated 9 months ago
ali-vilab / VACE
View on GitHub
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
☆3,638Oct 17, 2025Updated 4 months ago
Alpha-VLLM / Lumina-Image-2.0
View on GitHub
Lumina-Image 2.0: A Unified and Efficient Image Generative Framework
☆860Nov 3, 2025Updated 3 months ago
bytedance / UNO
View on GitHub
[ICCV 2025] 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning
☆1,350Sep 12, 2025Updated 5 months ago
SkyworkAI / SkyReels-A2
View on GitHub
SkyReels-A2: Compose anything in video diffusion transformers
☆701Jun 3, 2025Updated 8 months ago
FoundationVision / Infinity
View on GitHub
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,544Nov 10, 2025Updated 3 months ago
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,638Nov 29, 2025Updated 2 months ago
SilentView / GigaTok
View on GitHub
[ICCV 2025] Official repo for "GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation"
☆198Jan 7, 2026Updated last month
KlingAIResearch / ReCamMaster
View on GitHub
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
☆1,739Nov 28, 2025Updated 2 months ago
baaivision / NOVA
View on GitHub
[ICLR 2025] Autoregressive Video Generation without Vector Quantization
☆625Oct 29, 2025Updated 3 months ago
TencentARC / AnimeGamer
View on GitHub
[ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction
☆346Apr 9, 2025Updated 10 months ago
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆5,654Oct 27, 2025Updated 3 months ago
NVlabs / Sana
View on GitHub
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
☆4,963Updated this week
bytedance / InfiniteYou
View on GitHub
🔥 [ICCV 2025 Highlight] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity
☆2,663Aug 22, 2025Updated 5 months ago
Tencent-Hunyuan / InstantCharacter
View on GitHub
☆1,048May 14, 2025Updated 9 months ago
VARGPT-family / VARGPT-v1.1
View on GitHub
VARGPT-v1.1: Improve Visual Autoregressive Large Unified Model via Iterative Instruction Tuning and Reinforcement Learning
☆270Apr 15, 2025Updated 10 months ago
lzyhha / VisualCloze
View on GitHub
[ICCV 2025] VisualCloze: A universal image generation framework that can support a wide range of in-domain tasks and generalize to unseen…
☆277Jan 7, 2026Updated last month
SandAI-org / MAGI-1
View on GitHub
MAGI-1: Autoregressive Video Generation at Scale
☆3,641Jun 17, 2025Updated 8 months ago
HiDream-ai / HiDream-E1
View on GitHub
☆787Jul 17, 2025Updated 7 months ago
thu-ml / DiT-Extrapolation
View on GitHub
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) and UltraViCo (IC…
☆784Feb 2, 2026Updated 2 weeks ago
mit-han-lab / hart
View on GitHub
HART: Efficient Visual Generation with Hybrid Autoregressive Transformer
☆648Oct 16, 2024Updated last year
stepfun-ai / Step1X-Edit
View on GitHub
A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…
☆2,137Dec 29, 2025Updated last month
ZiyuGuo99 / Image-Generation-CoT
View on GitHub
[CVPR 2025] The First Investigation of CoT Reasoning (RL, TTS, Reflection) in Image Generation
☆855May 23, 2025Updated 8 months ago
PKU-Alignment / align-anything
View on GitHub
Align Anything: Training All-modality Model with Feedback
☆4,632Nov 27, 2025Updated 2 months ago
FoundationVision / FlashVideo
View on GitHub
[AAAI-2026]FlashVideo: Flowing Fidelity to Detail for Efficient High-Resolution Video Generation
☆457Mar 5, 2025Updated 11 months ago
FoundationVision / UniTok
View on GitHub
[NeurIPS 2025 Spotlight] A Unified Tokenizer for Visual Generation and Understanding
☆508Nov 14, 2025Updated 3 months ago
Diffusion-CoT / ReflectionFlow
View on GitHub
[ICCV 2025] Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning
☆216Nov 5, 2025Updated 3 months ago
FoundationVision / VAR
View on GitHub
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…
☆8,614Nov 10, 2025Updated 3 months ago
bytedance / DreamO
View on GitHub
[SIGGRAPH Asia 2025] DreamO: A Unified Framework for Image Customization
☆1,745Aug 14, 2025Updated 6 months ago
FoundationVision / LlamaGen
View on GitHub
Autoregressive Model Beats Diffusion: 🦙 Llama for Scalable Image Generation
☆1,932Aug 15, 2024Updated last year
SkyworkAI / Skywork-R1V
View on GitHub
Skywork-R1V is an advanced multimodal AI model series developed by Skywork AI, specializing in vision-language reasoning.
☆3,149Dec 15, 2025Updated 2 months ago
primecai / diffusion-self-distillation
View on GitHub
[CVPR 2025] Diffusion Self-Distillation for Zero-Shot Customized Image Generation
☆461Mar 18, 2025Updated 10 months ago
NexaAI / nexa-sdk
View on GitHub
Run frontier LLMs and VLMs with day-0 model support across GPU, NPU, and CPU, with comprehensive runtime coverage for PC (Python/C++), mo…
☆7,702Updated this week
TrajectoryCrafter / TrajectoryCrafter
View on GitHub
[ICCV 2025, Oral] TrajectoryCrafter: Redirecting Camera Trajectory for Monocular Videos via Diffusion Models
☆835Dec 17, 2025Updated 2 months ago

Alpha-VLLM / Lumina-mGPT-2.0View external linksLinks

Alternatives and similar repositories for Lumina-mGPT-2.0

Alpha-VLLM / Lumina-mGPT-2.0
View external linksLinks