NVlabs / Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
☆4,082Updated 2 weeks ago
Alternatives and similar repositories for Sana:
Users that are interested in Sana are comparing it to the libraries listed below
- Official repository for LTX-Video☆3,584Updated 2 weeks ago
- MAGI-1: Autoregressive Video Generation at Scale☆2,857Updated last week
- FastVideo is a lightweight framework for accelerating large video diffusion models.☆1,366Updated this week
- The best OSS video generation models☆3,138Updated 3 months ago
- HunyuanVideo-I2V: A Customizable Image-to-Video Model based on HunyuanVideo☆1,380Updated 3 weeks ago
- [ICLR 2025] Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,914Updated 4 months ago
- A general fine-tuning kit geared toward diffusion models.☆2,285Updated last week
- A minimal and universal controller for FLUX.1.☆1,534Updated last week
- Official implementations for paper: VACE: All-in-One Video Creation and Editing☆1,461Updated 2 weeks ago
- Official repository of In-Context LoRA for Diffusion Transformers☆1,838Updated 4 months ago
- ☆1,153Updated 4 months ago
- A SOTA open-source image editing model, which aims to provide comparable performance against the closed-source models like GPT-4o and Gem…☆1,111Updated this week
- ☆1,884Updated last week
- ☆2,030Updated 6 months ago
- ☆2,907Updated last month
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆2,135Updated 2 months ago
- SkyReels V1: The first and most advanced open-source human-centric video foundation model☆2,131Updated last month
- Implementation of "EasyControl: Adding Efficient and Flexible Control for Diffusion Transformer"☆1,426Updated 3 weeks ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,183Updated 2 months ago
- [ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models☆1,626Updated this week
- Stable Virtual Camera: Generative View Synthesis with Diffusion Models☆1,232Updated last week
- ☆812Updated 2 weeks ago
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,466Updated last week
- SkyReels-V2: Infinite-length Film Generative model☆1,841Updated last week
- 🔥 InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity☆2,171Updated 3 weeks ago
- Memory-optimized training library for diffusion models☆1,110Updated this week
- Lumina-Image 2.0: A Unified and Efficient Image Generative Framework☆683Updated 2 weeks ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,803Updated 6 months ago
- Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion☆1,530Updated 2 weeks ago
- CogView4, CogView3-Plus and CogView3(ECCV 2024)☆1,022Updated last month