NVlabs/Sana

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NVlabs/Sana)

NVlabs / Sana

SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer

☆8,505

Alternatives and similar repositories for Sana

Users that are interested in Sana are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,865Updated this week
NVlabs / LongLive
View on GitHub
Long Video Gen Infrastructure
☆2,483Updated this week
ByteDance-Seed / Bagel
View on GitHub
Open-source unified multimodal model
☆6,106May 4, 2026Updated 2 months ago
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,404Mar 5, 2025Updated last year
guandeh17 / Self-Forcing
View on GitHub
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
☆3,453Sep 12, 2025Updated 10 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
sihyun-yu / REPA
View on GitHub
[ICLR'25 Oral] Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think
☆1,679Mar 16, 2025Updated last year
Lightricks / LTX-Video
View on GitHub
Official repository for LTX-Video
☆10,716Jan 5, 2026Updated 6 months ago
yifan123 / flow_grpo
View on GitHub
[NeurIPS 2025] An official implementation of Flow-GRPO: Training Flow Matching Models via Online RL
☆2,423May 7, 2026Updated 2 months ago
tianweiy / CausVid
View on GitHub
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆1,399Aug 7, 2025Updated 11 months ago
ali-vilab / VACE
View on GitHub
[ICCV 2025] Official implementations for paper: VACE: All-in-One Video Creation and Editing
☆3,874Oct 17, 2025Updated 9 months ago
NVlabs / rcm
View on GitHub
rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale
☆768Jun 25, 2026Updated 3 weeks ago
PKU-YuanGroup / Helios
View on GitHub
Helios: Real Real-Time Long Video Generation Model
☆1,995Jun 10, 2026Updated last month
hustvl / LightningDiT
View on GitHub
[CVPR 2025 Oral] Reconstruction vs. Generation: Taming Optimization Dilemma in Latent Diffusion Models
☆1,508Dec 16, 2025Updated 7 months ago
NVIDIA / cosmos
View on GitHub
NVIDIA Cosmos is an open platform of world models, datasets, and tools that enables developers to build Physical AI for robots, autonomou…
☆11,162Updated this week
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
PixArt-alpha / PixArt-alpha
View on GitHub
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis
☆3,299Oct 31, 2024Updated last year
black-forest-labs / flux
View on GitHub
Official inference repo for FLUX.1 models
☆25,759Jul 31, 2025Updated 11 months ago
thu-ml / TurboDiffusion
View on GitHub
TurboDiffusion: 100–200× Acceleration for Video Diffusion Models
☆3,577Updated this week
thu-ml / Causal-Forcing
View on GitHub
[ICML 2026] Official codebase for "Causal Forcing: Autoregressive Diffusion Distillation Done Right for High-Quality Real-Time Interactiv…
☆874Updated this week
Yuanshi9815 / OminiControl
View on GitHub
[ICCV 2025 Highlight] OminiControl: Minimal and Universal Control for Diffusion Transformer
☆1,926Jul 2, 2026Updated 2 weeks ago
zai-org / CogVideo
View on GitHub
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
☆12,896Nov 4, 2025Updated 8 months ago
Wan-Video / Wan2.2
View on GitHub
Wan: Open and Advanced Large-Scale Video Generative Models
☆16,780Mar 17, 2026Updated 4 months ago
Tencent-Hunyuan / HY-WorldPlay
View on GitHub
HY-World 1.5: A Systematic Framework for Interactive World Modeling with Real-Time Latency and Geometric Consistency
☆1,558Jun 10, 2026Updated last month
Wan-Video / Wan2.1
View on GitHub
Wan: Open and Advanced Large-Scale Video Generative Models
☆16,615Mar 5, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
shengshu-ai / minWM
View on GitHub
A Minimal and Elegant Framework & Tutorial for Real-Time Interactive World Models
☆724Jun 15, 2026Updated last month
bytetriper / RAE
View on GitHub
Official PyTorch Implementation of "Diffusion Transformers with Representation Autoencoders"
☆1,977Feb 25, 2026Updated 4 months ago
KlingAIResearch / ReCamMaster
View on GitHub
[ICCV'25 Best Paper Finalist] ReCamMaster: Camera-Controlled Generative Rendering from A Single Video
☆1,830Nov 28, 2025Updated 7 months ago
QwenLM / Qwen-Image
View on GitHub
Qwen-Image is a powerful image generation foundation model capable of complex text rendering and precise image editing.
☆8,142Feb 10, 2026Updated 5 months ago
Robbyant / lingbot-world
View on GitHub
Advancing Open-source World Models
☆4,243Jul 9, 2026Updated last week
aigc-apps / VideoX-Fun
View on GitHub
📹 A more flexible framework that can generate videos at any resolution and creates videos from images.
☆2,178Updated this week
nv-tlabs / lyra
View on GitHub
Project Lyra: Open Generative 3D World Models
☆2,165Updated this week
lllyasviel / IC-Light
View on GitHub
More relighting!
☆8,473Feb 20, 2025Updated last year
NVlabs / DiffusionNFT
View on GitHub
[ICLR 2026 Oral] DiffusionNFT: Online Diffusion Reinforcement with Forward Process
☆975Feb 10, 2026Updated 5 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
Tencent-Hunyuan / HunyuanVideo
View on GitHub
HunyuanVideo: A Systematic Framework For Large Video Generation Model
☆12,350Jun 29, 2026Updated 3 weeks ago
baaivision / Emu3.5
View on GitHub
Native Multimodal Models are World Learners
☆1,536Dec 30, 2025Updated 6 months ago
nunchaku-ai / nunchaku
View on GitHub
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
☆3,911Mar 7, 2026Updated 4 months ago
JiuhaiChen / BLIP3o
View on GitHub
Official implementation of BLIP3o-Series
☆1,663Nov 29, 2025Updated 7 months ago
FoundationVision / VAR
View on GitHub
[NeurIPS 2024 Best Paper Award][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Mod…
☆8,708Nov 10, 2025Updated 8 months ago
FoundationVision / Infinity
View on GitHub
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,579Apr 16, 2026Updated 3 months ago
lllyasviel / FramePack
View on GitHub
Lets make video diffusion practical!
☆17,124Oct 16, 2025Updated 9 months ago