shawnricecake / draft-attentionView external linksLinks
Code for Draft Attention
☆99May 22, 2025Updated 8 months ago
Alternatives and similar repositories for draft-attention
Users that are interested in draft-attention are comparing it to the libraries listed below
Sorting:
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆55May 21, 2025Updated 8 months ago
- [ICML2025, NeurIPS2025 Spotlight] Sparse VideoGen 1 & 2: Accelerating Video Diffusion Transformers with Sparse Attention☆627Feb 3, 2026Updated last week
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆50Jun 6, 2025Updated 8 months ago
- Distributed parallel 3D-Causal-VAE for efficient training and inference☆46Aug 20, 2025Updated 5 months ago
- Benchmark tests supporting the TiledCUDA library.☆18Nov 19, 2024Updated last year
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆269Jul 6, 2025Updated 7 months ago
- PyTorch implementation of the Flash Spectral Transform Unit.☆21Sep 19, 2024Updated last year
- ☆33Jul 9, 2025Updated 7 months ago
- lite attention implemented over flash attention 3☆45Updated this week
- ☆32Jul 2, 2025Updated 7 months ago
- [ICCV2025] From Reusing to Forecasting: Accelerating Diffusion Models with TaylorSeers☆365Aug 11, 2025Updated 6 months ago
- Enhancing Motion Dynamics of Image-to-Video Models via Adaptive Low-Pass Guidance (arXiv 2025)☆54Jul 20, 2025Updated 6 months ago
- [NeurIPS 2025] Training-Free Efficient Video Generation via Dynamic Token Carving☆272Aug 4, 2025Updated 6 months ago
- [ICCV 2025] Official repository of the paper "Beyond the Frame: Generating 360° Panoramic Videos from Perspective Videos"☆36Feb 2, 2026Updated last week
- Frame Guidance: Training-Free Guidance for Frame-Level Control in Video Diffusion Model (ICLR 2026)☆41Jul 10, 2025Updated 7 months ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆47Jul 17, 2025Updated 6 months ago
- [ICML2025] SpargeAttention: A training-free sparse attention that accelerates any model inference.☆927Dec 31, 2025Updated last month
- ☆131May 29, 2025Updated 8 months ago
- E-GRPO: High Entropy Steps Drive Effective Reinforcement Learning for Flow Models☆38Jan 5, 2026Updated last month
- Beyond KV Caching: Shared Attention for Efficient LLMs☆20Jul 19, 2024Updated last year
- [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time☆322Oct 31, 2025Updated 3 months ago
- [ACL 2025] Squeezed Attention: Accelerating Long Prompt LLM Inference☆56Nov 20, 2024Updated last year
- ☆14May 14, 2019Updated 6 years ago
- ☆10Nov 9, 2023Updated 2 years ago
- ☆190Jan 14, 2025Updated last year
- The simplest implementation of recent Sparse Attention patterns for efficient LLM inference.☆92Jul 17, 2025Updated 6 months ago
- Learning to Skip the Middle Layers of Transformers☆17Aug 7, 2025Updated 6 months ago
- Official Implementation of ACL2023: Don't Parse, Choose Spans! Continuous and Discontinuous Constituency Parsing via Autoregressive Span …☆14Aug 25, 2023Updated 2 years ago
- ☆13Jan 7, 2025Updated last year
- diffusers with search engine☆12Jan 13, 2026Updated last month
- ☆12Jan 29, 2021Updated 5 years ago
- An efficient distillation method for flow matching models☆22Feb 1, 2026Updated last week
- A Distributed Attention Towards Linear Scalability for Ultra-Long Context, Heterogeneous Data Training☆631Feb 6, 2026Updated last week
- [NeurIPS 2025] Official implementation of HiFlow: Training-free High-Resolution Image Generation with Flow-Aligned Guidance☆84Sep 18, 2025Updated 4 months ago
- 🎬 3.7× faster video generation E2E 🖼️ 1.6× faster image generation E2E ⚡ ColumnSparseAttn 9.3× vs FlashAttn‑3 💨 ColumnSparseGEMM 2.5× …☆101Sep 8, 2025Updated 5 months ago
- Open-sourcing code associated with the AAAI-25 paper "On the Expressiveness and Length Generalization of Selective State-Space Models on …☆14Sep 18, 2025Updated 4 months ago
- Aligning Agentic World Models via Knowledgeable Experience Learning☆28Jan 25, 2026Updated 2 weeks ago
- Geometry-Consistent Video Diffusion for Robotic Visual Policy Transfer☆28Nov 4, 2025Updated 3 months ago
- A selective knowledge distillation algorithm for efficient speculative decoders☆36Nov 27, 2025Updated 2 months ago