moonmath-ai/LiteAttention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/moonmath-ai/LiteAttention)

moonmath-ai / LiteAttention

Transforming Video Diffusion with Temporal Sparse Attention

☆56

Alternatives and similar repositories for LiteAttention

Users that are interested in LiteAttention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ziplab / Pyramid-Sparse-Attention
View on GitHub
Official PyTorch implementation of [PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation](https://arxiv.org/abs…
☆25Jan 25, 2026Updated 6 months ago
Bluear7878 / H2-Cache-A-Hierarchical-Dual-Stage-Cache
View on GitHub
☆22Nov 3, 2025Updated 8 months ago
G-U-N / consolver
View on GitHub
[CVPR 2026 (Highlight)] Unofficial Implementation of "Image Diffusion Preview with Consistency Solver"
☆31Jan 24, 2026Updated 6 months ago
ModelTC / quant_horizon
View on GitHub
☆11Jan 10, 2025Updated last year
luongthecong123 / fp8-quant-matmul
View on GitHub
Row-wise block scaling for fp8 quantization matrix multiplication. Solution to GPU mode AMD challenge.
☆19Feb 9, 2026Updated 5 months ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
ModelTC / QVGen
View on GitHub
[ICLR 2026] This is the official PyTorch implementation of "QVGen: Pushing the Limit of Quantized Video Generative Models".
☆32Feb 11, 2026Updated 5 months ago
sandyresearch / chipmunk
View on GitHub
🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …
☆111Sep 8, 2025Updated 10 months ago
KlingAIResearch / VMoBA
View on GitHub
Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"
☆64Jul 1, 2025Updated last year
ZhenglinZhou / Zero-1-to-A
View on GitHub
[CVPR 2025] Zero-1-to-A: Zero-Shot One Image to Animatable Head Avatars Using Video Diffusion
☆43Mar 21, 2025Updated last year
svg-project / Sparse-VideoGen
View on GitHub
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
☆697Jul 4, 2026Updated 3 weeks ago
gen-ai-team / Wan2.1-NABLA
View on GitHub
Wan: Open and Advanced Large-Scale Video Generative Models
☆31Jul 28, 2025Updated last year
yifu-ding / BGEMM-CUDA
View on GitHub
BGEMM-CUDA is a CUDA-based low-bit GEMM kernel library for efficient neural network inference. It implements optimized binary and ternary…
☆20Aug 30, 2024Updated last year
HiDream-ai / DreamJourney
View on GitHub
[TMM 2025] Official Implementation of DreamJourney: Perpetual View Generation with Video Diffusion Models
☆18Jun 24, 2025Updated last year
facebookexperimental / CUTracer
View on GitHub
A dynamic binary instrumentation tool for tracing and analyzing CUDA kernel instructions.
☆76Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Sugewud / Safe-Sora
View on GitHub
[NeurIPS 2025] The official implementation of paper "Safe-Sora: Safe Text-to-Video Generation via Graphical Watermarking"
☆20Oct 10, 2025Updated 9 months ago
thu-ml / SpargeAttn
View on GitHub
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
☆1,019Feb 25, 2026Updated 5 months ago
DualParal-Project / DualParal
View on GitHub
[AAAI 2026] Minute-Long Videos with Dual Parallelisms
☆50Mar 25, 2026Updated 4 months ago
danielvegamyhre / ml-perf-reading-group
View on GitHub
EleutherAI ML Performance reading group repository (slides, meeting recordings, annotated papers)
☆36Mar 20, 2026Updated 4 months ago
AniAggarwal / ecad
View on GitHub
[ICLR 2026] Code for Evolutionary Caching to Accelerate Your Off-the-Shelf Diffusion Model
☆30Mar 1, 2026Updated 4 months ago
thu-ml / SLA
View on GitHub
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention
☆324Feb 24, 2026Updated 5 months ago
kandinskylab / kvae
View on GitHub
KVAE tokenizers
☆40Apr 21, 2026Updated 3 months ago
Xingyu-Zheng / BiDM
View on GitHub
(NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models
☆22Jul 16, 2026Updated last week
xinghaow99 / pbs-attn
View on GitHub
[ICML 2026] Sparser Block-Sparse Attention via Token Permutation
☆31May 22, 2026Updated 2 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
huggingface / flux-fast
View on GitHub
Making Flux go brrr on GPUs.
☆170Jan 5, 2026Updated 6 months ago
dc-ai-projects / DC-VideoGen
View on GitHub
DC-VideoGen: Efficient Video Generation with Deep Compression Video Autoencoder
☆192Oct 5, 2025Updated 9 months ago
tonysy / CapsuleNet-PyTorch
View on GitHub
Implemention of CapsNet from the paper Dynamic Routing Between Capsules
☆10Nov 7, 2017Updated 8 years ago
BienLuky / Rectified-SpaAttn
View on GitHub
The official implementation of "Rectified SpaAttn: Revisiting Attention Sparsity for Efficient Video Generation"
☆22Feb 8, 2026Updated 5 months ago
SonyResearch / IISA
View on GitHub
[ICCV 2025] - Image Intrinsic Scale Assessment: Bridging the Gap Between Quality and Resolution
☆17Aug 16, 2025Updated 11 months ago
Shenyi-Z / Cache4Diffusion
View on GitHub
Aiming to integrate most existing feature caching-based diffusion acceleration schemes into a unified framework.
☆110Oct 23, 2025Updated 9 months ago
AlonKellner / waloviz
View on GitHub
An open source interactive spectrogram audio player, primarily based on bokeh and the holoviz stack (wav+holoviz=waloviz)
☆67Jan 19, 2026Updated 6 months ago
SandAI-org / MagiAttention
View on GitHub
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
☆891Updated this week
UVA-Computer-Vision-Lab / FrameINO
View on GitHub
[NeurIPS 2025] Frame In-N-Out: Unbounded Controllable Image-to-Video Generation
☆33May 1, 2026Updated 2 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
feice-huang / ConvRot
View on GitHub
Official ConvRot implementation. A plug-and-play, convolution-like rotation module enabling efficient W4A4 quantization for diffusion mod…
☆20Jul 3, 2026Updated 3 weeks ago
thu-nics / MixDQ
View on GitHub
[ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization
☆50Nov 27, 2024Updated last year
bytedance / ERTACache
View on GitHub
☆25Sep 4, 2025Updated 10 months ago
leungll / NENU-Letter-Template
View on GitHub
Made with LaTex. NENU's recommendation letter template.
☆12May 26, 2024Updated 2 years ago
alimohammadiamirhossein / cora
View on GitHub
✨ PyTorch implementation of "Cora: Correspondence-aware Image Editing Using Few-Step Diffusion", accepted at SIGGRAPH 2025.
☆35Jun 3, 2025Updated last year
YuyaoZhangQAQ / QCompiler
View on GitHub
This repository contains the code for the paper “Neuro-Symbolic Query Compiler”, accepted to the Findings of ACL 2025.
☆17Oct 20, 2025Updated 9 months ago
suimuc / MTV_Framework
View on GitHub
☆23Oct 15, 2025Updated 9 months ago