chengzeyi / ParaAttentionLinks
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
☆373Updated 2 months ago
Alternatives and similar repositories for ParaAttention
Users that are interested in ParaAttention are comparing it to the libraries listed below
Sorting:
- Radial Attention Official Implementation☆500Updated last week
- Light Video Generation Inference Framework☆558Updated this week
- Model Compression Toolbox for Large Language Models and Diffusion Models☆628Updated last month
- End-to-end recipes for optimizing diffusion models with torchao and diffusers (inference and FP8 training).☆377Updated 3 months ago
- [ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆278Updated last month
- [ICLR 2025] FasterCache: Training-Free Video Diffusion Model Acceleration with High Quality☆245Updated 8 months ago
- Combining Teacache with xDiT to Accelerate Visual Generation Models☆31Updated 4 months ago
- [ICML2025] Sparse VideoGen: Accelerating Video Diffusion Transformers with Spatial-Temporal Sparsity☆424Updated 2 weeks ago
- [NeurIPS 2024] AsyncDiff: Parallelizing Diffusion Models by Asynchronous Denoising☆205Updated 6 months ago
- Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model☆1,097Updated 3 months ago
- A Unified Cache Acceleration Toolbox for 🤗Diffusers: FLUX.1, Qwen-Image-Edit, Qwen-Image, HunyuanImage-2.1, Wan 2.1/2.2, etc.☆275Updated this week
- SpargeAttention: A training-free sparse attention that can accelerate any model inference.☆709Updated last month
- The official code for "MagCache: Fast Video Generation with Magnitude-Aware Cache"☆199Updated 3 weeks ago
- Flux diffusion model implementation using quantized fp8 matmul & remaining layers use faster half precision accumulate, which is ~2x fast…☆276Updated 11 months ago
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆139Updated 2 months ago
- [CVPR 2024 Highlight] DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models☆704Updated 9 months ago
- ☆231Updated this week
- 📚A curated list of Awesome Diffusion Inference Papers with Codes: Sampling, Cache, Quantization, Parallelism, etc.🎉☆394Updated 3 weeks ago
- Making Flux go brrr on GPUs.☆137Updated last month
- ☆173Updated 8 months ago
- faster parallel inference of mochi-1 video generation model☆126Updated 6 months ago
- [ICLR2025] Accelerating Diffusion Transformers with Token-wise Feature Caching☆176Updated 6 months ago
- [NeurIPS 2024] Boosting the performance of consistency models with PCM!☆494Updated 9 months ago
- ☆284Updated 8 months ago
- An out-of-the-box inference acceleration engine for Diffusion and DiT models☆53Updated 5 months ago
- A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training☆502Updated last week
- Enhance-A-Video: Better Generated Video for Free☆571Updated 5 months ago
- A parallelism VAE avoids OOM for high resolution image generation☆77Updated last month
- T-GATE: Temporally Gating Attention to Accelerate Diffusion Model for Free!☆406Updated 6 months ago
- Faster generation with text-to-image diffusion models.☆226Updated 2 months ago