xdit-project / xDiTLinks

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

☆2,172

Alternatives and similar repositories for xDiT

Users that are interested in xDiT are comparing it to the libraries listed below

Sorting:

thu-ml / SageAttention
Quantized Attention achieves speedup of 2-5x and 3-11x compared to FlashAttention and xformers, without lossing end-to-end metrics across…
☆2,122Updated 2 weeks ago
thu-ml / SpargeAttn
SpargeAttention: A training-free sparse attention that can accelerate any model inference.
☆670Updated last week
hao-ai-lab / FastVideo
An unified inference and post-training framework for accelerated video generation.
☆1,690Updated this week
SandAI-org / MagiAttention
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
☆456Updated this week
mit-han-lab / distrifuser
[CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models
☆699Updated 8 months ago
xlite-dev / Awesome-DiT-Inference
📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉
☆350Updated last week
nunchaku-tech / deepcompressor
Model Compression Toolbox for Large Language Models and Diffusion Models
☆578Updated 4 months ago
NUS-HPC-AI-Lab / VideoSys
VideoSys: An easy and efficient system for video generation
☆1,992Updated 4 months ago
ali-vilab / TeaCache
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
☆1,023Updated 2 months ago
FoundationVision / Infinity
[CVPR 2025 Oral]Infinity ∞ : Scaling Bitwise AutoRegressive Modeling for High-Resolution Image Synthesis
☆1,393Updated last month
chengzeyi / ParaAttention
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
☆350Updated last month
huggingface / finetrainers
Scalable and memory-optimized training of diffusion models
☆1,243Updated 2 months ago
nunchaku-tech / nunchaku
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
☆2,545Updated this week
zhihu / ZhiLight
A highly optimized LLM inference acceleration engine for Llama and its variants.
☆902Updated 3 weeks ago
horseee / DeepCache
[CVPR 2024] DeepCache: Accelerating Diffusion Models for Free
☆914Updated last year
chengzeyi / stable-fast
https://wavespeed.ai/ Best inference performance optimization framework for HuggingFace Diffusers on NVIDIA GPUs.
☆1,281Updated 4 months ago
rhymes-ai / Allegro
Allegro is a powerful text-to-video model that generates high-quality videos up to 6 seconds at 15 FPS and 720p resolution from simple te…
☆1,089Updated 6 months ago
bytedance / flux
A fast communication-overlapping library for tensor/expert parallelism on GPUs.
☆1,046Updated last week
Vchitect / VBench
[CVPR2024 Highlight] VBench - We Evaluate Video Generation
☆1,135Updated last week
feifeibear / long-context-attention
USP: Unified (a.k.a. Hybrid, 2D) Sequence Parallel Attention for Long Context Transformers Model Training and Inference
☆541Updated 3 weeks ago
IDKiro / sdxs
Official repo of our paper "SDXS: Real-Time One-Step Latent Diffusion Models with Image Conditions"
☆628Updated last year
svg-project / Sparse-VideoGen
[ICML2025] Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity
☆393Updated 2 months ago
alibaba / Tora
[CVPR'25]Tora: Trajectory-oriented Diffusion Transformer for Video Generation
☆1,193Updated 3 weeks ago
Vchitect / Vchitect-2.0
Vchitect-2.0: Parallel Transformer for Scaling Up Video Diffusion Models
☆915Updated 4 months ago
tianweiy / CausVid
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆836Updated 2 months ago
siliconflow / onediff
OneDiff: An out-of-the-box acceleration library for diffusion models.
☆1,919Updated 3 months ago
NJU-PCALab / RAG-Diffusion
[ICCV 2025] Region-Aware Text-to-Image Generation via Hard Binding and Soft Refinement 🔥
☆593Updated last month
mit-han-lab / radial-attention
Radial Attention Official Implementation
☆470Updated this week
thu-ml / RIFLEx
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025)
☆704Updated 2 months ago
Shenyi-Z / TaylorSeer
[ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
☆245Updated last month