moonmath-ai / LiteAttentionLinks
lite attention implemented over flash attention 3
☆32Updated this week
Alternatives and similar repositories for LiteAttention
Users that are interested in LiteAttention are comparing it to the libraries listed below
Sorting:
- (NeurIPS 2024) BiDM: Pushing the Limit of Quantization for Diffusion Models☆22Updated last year
- Official implementation of DiCache: Let Diffusion Model Determine Its Own Cache☆52Updated last month
- A curated list of research papers, resources, and advancements on Diffusion Cache and related efficient diffusion model acceleration tech…☆56Updated 3 weeks ago
- Implementation of SmoothCache, a project aimed at speeding-up Diffusion Transformer (DiT) based GenAI models with error-guided caching.☆46Updated 4 months ago
- [NeurIPS 2025] ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆51Updated 3 weeks ago
- The official implementation of "Sparse-vDiT: Unleashing the Power of Sparse Attention to Accelerate Video Diffusion Transformers" (arXiv …☆49Updated 5 months ago
- This repository includes the official implementation of our paper "Grouping First, Attending Smartly: Training-Free Acceleration for Diff…☆53Updated 6 months ago
- CutDiffusion: A Simple, Fast, Cheap, and Strong Diffusion Extrapolation Method☆27Updated last month
- Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model☆11Updated 11 months ago
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆40Updated 8 months ago
- (ICCV2025) EEdit⚡: Rethinking the Spatial and Temporal Redundancy for Efficient Image Editing☆58Updated 2 months ago
- Code for Draft Attention☆93Updated 6 months ago
- [Arxiv 2025] SparseD: Sparse Attention for Diffusion Language Models☆49Updated last month
- An official implementation of EvoSearch: Scaling Image and Video Generation via Test-Time Evolutionary Search☆99Updated last month
- ☆129Updated 5 months ago
- Code for our ICCV 2025 paper "Adaptive Caching for Faster Video Generation with Diffusion Transformers"☆161Updated last year
- ☆73Updated last month
- Multimodal Representation Alignment for Image Generation: Text-Image Interleaved Control Is Easier Than You Think!☆120Updated 8 months ago
- (ToCa-v2) A New version of ToCa,with faster speed and better acceleration!☆39Updated 8 months ago
- [NeurIPS 2025] IEAP: Image Editing As Programs with Diffusion Models☆107Updated 2 months ago
- (CVPR 2025) Scailing Down Text Encoders of Text-to-Image Diffusion Models☆48Updated 2 months ago
- ☆77Updated 2 months ago
- [Preprint] Efficient Generative Model Training via Embedded Representation Warmup☆36Updated last month
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆55Updated 4 months ago
- A light-weight and high-efficient training framework for accelerating diffusion tasks.☆50Updated last year
- ☆11Updated last year
- Official Implementation of "LeX-Art: Rethinking Text Generation via Scalable High-Quality Data Synthesis"☆73Updated 3 months ago
- Transition Models☆133Updated last month
- [ICML 2025] Official PyTorch implementation of paper "Ultra-Resolution Adaptation with Ease".☆114Updated 6 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆116Updated last year