SandAI-org / MAGI-1
MAGI-1: Autoregressive Video Generation at Scale
☆2,056Updated this week
Alternatives and similar repositories for MAGI-1:
Users that are interested in MAGI-1 are comparing it to the libraries listed below
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,349Updated last week
- SkyReels-V2: Infinite-length Film Generative model☆853Updated this week
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,095Updated last month
- ☆2,859Updated last month
- SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer☆4,024Updated this week
- Official PyTorch implementation of One-Minute Video Generation with Test-Time Training☆1,417Updated last week
- FastVideo is a lightweight framework for accelerating large video diffusion models.☆1,352Updated this week
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆1,401Updated this week
- ☆1,698Updated last week
- 🔥🔥 UNO: A Universal Customization Method for Both Single and Multi-Subject Conditioning☆876Updated last week
- Official repository for LTX-Video☆3,506Updated last week
- Video Generation Foundation Models: https://saiyan-world.github.io/goku/☆2,809Updated 2 months ago
- ☆732Updated 2 months ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,900Updated 4 months ago
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,131Updated last month
- The best OSS video generation models☆3,102Updated 3 months ago
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,198Updated 3 weeks ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,003Updated 3 weeks ago
- Wan: Open and Advanced Large-Scale Video Generative Models☆10,569Updated this week
- Memory-optimized training library for diffusion models☆1,068Updated this week
- A minimal and universal controller for FLUX.1.☆1,485Updated last week
- The official repo of MiniMax-Text-01 and MiniMax-VL-01, large-language-model & vision-language-model based on Linear Attention☆2,540Updated 2 weeks ago
- [ARXIV'25] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video☆1,082Updated this week
- Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and pe…☆2,634Updated last week
- A pipeline parallel training script for diffusion models.☆937Updated this week
- Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment☆679Updated this week
- [CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis☆1,200Updated 2 months ago
- SkyReels-A1: Expressive Portrait Animation in Video Diffusion Transformers☆491Updated this week
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,425Updated this week
- Official repository of In-Context LoRA for Diffusion Transformers☆1,823Updated 4 months ago