NVlabs / Sana
SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer
☆3,386Updated last week
Alternatives and similar repositories for Sana:
Users that are interested in Sana are comparing it to the libraries listed below
- Official repository for LTX-Video☆2,857Updated this week
- Code of Pyramidal Flow Matching for Efficient Video Generative Modeling☆2,780Updated last month
- The best OSS video generation models☆2,915Updated last month
- ☆1,880Updated 3 months ago
- Lumina-T2X is a unified framework for Text to Any Modality Generation☆2,156Updated this week
- A general fine-tuning kit geared toward diffusion models.☆2,092Updated last week
- Various AI scripts. Mostly Stable Diffusion stuff.☆4,024Updated this week
- ☆983Updated last month
- OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340☆3,596Updated last week
- 📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion☆1,987Updated this week
- [NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment☆3,103Updated 2 months ago
- A minimal and universal controller for FLUX.1.☆1,202Updated 3 weeks ago
- PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation☆1,761Updated 3 months ago
- FastVideo is a lightweight framework for accelerating large video diffusion models.☆1,095Updated this week
- STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution☆939Updated 3 weeks ago
- [arXiv 2024] Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,096Updated this week
- Controllable video and image Generation, SVD, Animate Anyone, ControlNet, ControlNeXt, LoRA☆1,520Updated 4 months ago
- PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis☆2,971Updated 3 months ago
- Your image is almost there!☆7,498Updated 6 months ago
- HunyuanVideo: A Systematic Framework For Large Video Generation Model☆8,621Updated this week
- Official repository of In-Context LoRA for Diffusion Transformers☆1,595Updated 2 months ago
- [ECCV 2024, Oral] DynamiCrafter: Animating Open-domain Images with Video Diffusion Priors☆2,773Updated 5 months ago
- Taming Stable Diffusion for Lip Sync!☆2,538Updated last month
- ☆778Updated 3 weeks ago
- ☆2,176Updated last week
- Dead simple FLUX LoRA training UI with LOW VRAM support☆2,004Updated last month