Code for Draft Attention
☆99May 22, 2025Updated 9 months ago
Alternatives and similar repositories for draft-attention
Users that are interested in draft-attention are comparing it to the libraries listed below
Sorting:
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆55May 21, 2025Updated 9 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆629Feb 3, 2026Updated last month
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆51Jun 6, 2025Updated 9 months ago
- ☆20Dec 24, 2024Updated last year
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆269Jul 6, 2025Updated 8 months ago
- PyTorch implementation of the Flash Spectral Transform Unit.☆21Sep 19, 2024Updated last year
- lite attention implemented over flash attention 3☆45Updated this week
- ☆33Jul 9, 2025Updated 7 months ago
- ☆32Jul 2, 2025Updated 8 months ago
- [ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆373Feb 16, 2026Updated 2 weeks ago
- Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026)☆53Feb 23, 2026Updated last week
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆275Aug 4, 2025Updated 7 months ago
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Models (ICLR 2026)☆42Updated this week
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆48Jul 17, 2025Updated 7 months ago
- [ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.☆952Feb 25, 2026Updated last week
- [ICCV 2025] Official repository of the paper "Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos"☆39Feb 2, 2026Updated last month
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆57Nov 20, 2024Updated last year
- ☆14May 14, 2019Updated 6 years ago
- ☆136May 29, 2025Updated 9 months ago
- [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time☆329Oct 31, 2025Updated 4 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆39Jan 5, 2026Updated 2 months ago
- A sparse attention kernel supporting mix sparse patterns☆472Jan 18, 2026Updated last month
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆91Jul 17, 2025Updated 7 months ago
- ☆191Jan 14, 2025Updated last year
- The official code of "Beyond VLM-Based Rewards: Diffusion-Native Latent Reward Modeling"☆47Feb 26, 2026Updated last week
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- An efficient distillation method for flow matching models☆22Feb 1, 2026Updated last month
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- diffusers with search engine☆12Jan 13, 2026Updated last month
- ☆12Jan 29, 2021Updated 5 years ago
- ☆13Jan 7, 2025Updated last year
- This is the official implementation of our paper: “VTinker: Guided Flow Upsampling and Texture Mapping for High-Resolution Video Frame In…☆16Dec 5, 2025Updated 3 months ago
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆101Sep 8, 2025Updated 5 months ago
- A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training☆659Updated this week
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆86Sep 18, 2025Updated 5 months ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Nov 4, 2025Updated 4 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆31Jan 25, 2026Updated last month