mit-han-lab/radial-attention

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mit-han-lab/radial-attention)

mit-han-lab / radial-attention

[NeurIPS 2025] Radial Attention: O(nlogn) Sparse Attention with Energy Decay for Long Video Generation

☆604

Alternatives and similar repositories for radial-attention

Users that are interested in radial-attention are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

svg-project / Sparse-VideoGen
View on GitHub
[ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention
☆697Jul 4, 2026Updated 3 weeks ago
thu-ml / SpargeAttn
View on GitHub
[ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.
☆1,019Feb 25, 2026Updated 5 months ago
tianweiy / CausVid
View on GitHub
(CVPR 2025) From Slow Bidirectional to Fast Autoregressive Video Diffusion Models
☆1,409Aug 7, 2025Updated 11 months ago
thu-ml / SageAttention
View on GitHub
[ICLR2025, ICML2025, NeurIPS2025 Spotlight] Quantized Attention achieves speedup of 2-5x compared to FlashAttention, without losing end-t…
☆3,518Jan 17, 2026Updated 6 months ago
hao-ai-lab / FastVideo
View on GitHub
A unified inference and post-training framework for accelerated video generation.
☆3,888Updated this week
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
thu-ml / SLA
View on GitHub
SLA: Beyond Sparsity in Diffusion Transformers via Fine-Tunable Sparse–Linear Attention
☆324Feb 24, 2026Updated 5 months ago
guandeh17 / Self-Forcing
View on GitHub
Official codebase for "Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion" (NeurIPS 2025 Spotlight)
☆3,465Sep 12, 2025Updated 10 months ago
ali-vilab / TeaCache
View on GitHub
Timestep Embedding Tells: It's Time to Cache for Video Diffusion Model
☆1,358Jun 8, 2025Updated last year
KlingAIResearch / VMoBA
View on GitHub
Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"
☆64Jul 1, 2025Updated last year
JIA-Lab-research / Jenga
View on GitHub
[NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving
☆287Aug 4, 2025Updated 11 months ago
nunchaku-ai / nunchaku
View on GitHub
[ICLR2025 Spotlight] SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
☆3,920Mar 7, 2026Updated 4 months ago
chengzeyi / ParaAttention
View on GitHub
https://wavespeed.ai/ Context parallel attention that accelerates DiT model inference with dynamic caching
☆427Jul 5, 2025Updated last year
mit-han-lab / Block-Sparse-Attention
View on GitHub
A sparse attention kernel supporting mix sparse patterns
☆539Jan 18, 2026Updated 6 months ago
Zehong-Ma / MagCache
View on GitHub
The official code for NeurIPS 2025 "MagCache: Fast Video Generation with Magnitude-Aware Cache"
☆275Nov 17, 2025Updated 8 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
Yaofang-Liu / Pusa-VidGen
View on GitHub
Pusa: Thousands Timesteps Video Diffusion Model
☆686Feb 13, 2026Updated 5 months ago
NVlabs / rcm
View on GitHub
rCM & Causal-rCM: Leading and Unified Algorithms/Infrastructures for Bidirectional/Autoregressive Video Diffusion Distillation at Scale
☆774Jun 25, 2026Updated last month
Peyton-Chen / Sparse-vDiT
View on GitHub
The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …
☆52Jun 6, 2025Updated last year
ModelTC / LightX2V
View on GitHub
Lightweight Image Video Action Generation Inference Framework
☆2,540Updated this week
nunchaku-ai / deepcompressor
View on GitHub
Model Compression Toolbox for Large Language Models and Diffusion Models
☆796Aug 14, 2025Updated 11 months ago
GoatWu / Self-Forcing-Plus
View on GitHub
Unofficial extension implementation of Self-Forcing to support I2V && 14B training.
☆380Sep 29, 2025Updated 10 months ago
mit-han-lab / x-attention
View on GitHub
[ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring
☆280Jul 6, 2025Updated last year
madebyollin / taehv
View on GitHub
Tiny AutoEncoder for Hunyuan Video (and other video models)
☆450Jul 14, 2026Updated 2 weeks ago
svg-project / Quant-VideoGen
View on GitHub
[ICML2026] Auto-Regressive Long Video Generation via 2-Bit KV-Cache Quantization
☆61Updated this week
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
SandAI-org / MagiAttention
View on GitHub
A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training
☆891Updated this week
tianweiy / DMD2
View on GitHub
(NeurIPS 2024 Oral 🔥) Improved Distribution Matching Distillation for Fast Image Synthesis
☆1,415Mar 5, 2025Updated last year
Zehong-Ma / ComfyUI-MagCache
View on GitHub
The official code that integrates MagCache (Fast Video Generation with Magnitude-Aware Cache) with ComfyUI.
☆275Nov 27, 2025Updated 8 months ago
justincui03 / Self-Forcing-Plus-Plus
View on GitHub
Official Repo for Self-Forcing++ High Quality Long Video Generation
☆267Oct 13, 2025Updated 9 months ago
jt-zhang / Sparse_Attention_API
View on GitHub
☆66Oct 25, 2025Updated 9 months ago
mit-han-lab / lpd
View on GitHub
[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆104May 8, 2026Updated 2 months ago
thu-ml / DiT-Extrapolation
View on GitHub
Official implementation for "RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers" (ICML 2025) , UltraViCo (ICLR…
☆820Jun 6, 2026Updated last month
ZulutionAI / MoviiGen1.1
View on GitHub
MoviiGen 1.1: Towards Cinematic-Quality Video Generative Models
☆183Jul 21, 2025Updated last year
vipshop / cache-dit
View on GitHub
A PyTorch-native inference engine with cache, parallelism, quantization and cpu offload for DiTs.
☆1,239Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
JaydenLyh / Reward-Forcing
View on GitHub
[CVPR 2026 Highlight] Reward Forcing: Efficient Streaming Video Generation with Rewarded Distribution Matching Distillation
☆352Dec 15, 2025Updated 7 months ago
FoundationVision / InfinityStar
View on GitHub
[NeurIPS 2025 Oral]Infinity⭐️: Uniﬁed Spacetime AutoRegressive Modeling for Visual Generation
☆774Apr 16, 2026Updated 3 months ago
shawnricecake / draft-attention
View on GitHub
Code for Draft Attention
☆103May 22, 2025Updated last year
SandAI-org / MAGI-1
View on GitHub
MAGI-1: Autoregressive Video Generation at Scale
☆3,748Jun 17, 2026Updated last month
Lakonik / LakonLab
View on GitHub
Official implementation of AsymFlow, pi-Flow, GMFlow
☆455Jul 14, 2026Updated 2 weeks ago
Shenyi-Z / TaylorSeer
View on GitHub
[ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers
☆408Mar 2, 2026Updated 4 months ago
MAGREF-Video / MAGREF
View on GitHub
Official implementation of MAGREF: Masked Guidance for Any-Reference Video Generation with Subject Disentanglement (ICLR2026)
☆298Mar 24, 2026Updated 4 months ago