xuyang-liu16 / Awesome-Generation-Acceleration
π Collection of awesome generation acceleration resources.
β155Updated last week
Alternatives and similar repositories for Awesome-Generation-Acceleration:
Users that are interested in Awesome-Generation-Acceleration are comparing it to the libraries listed below
- Accelerating Diffusion Transformers with Token-wise Feature Cachingβ80Updated last week
- β148Updated last month
- Adaptive Caching for Faster Video Generation with Diffusion Transformersβ142Updated 3 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Cachingβ94Updated 7 months ago
- [ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generationβ55Updated last week
- This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"β45Updated last month
- This is a repo to track the latest autoregressive visual generation papers.β150Updated this week
- SpeeD: A Closer Look at Time Steps is Worthy of Triple Speed-Up for Diffusion Model Trainingβ176Updated last month
- [CVPR 2024 Highlight] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for Diffusion Moβ¦β60Updated 7 months ago
- [ICLR 2025] VILA-U: a Unified Foundation Model Integrating Visual Understanding and Generationβ232Updated last month
- HART: Efficient Visual Generation with Hybrid Autoregressive Transformerβ420Updated 4 months ago
- π Collection of token reduction for model compression resources.β35Updated 2 weeks ago
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.β38Updated 7 months ago
- Scaling Diffusion Transformers with Mixture of Expertsβ275Updated 5 months ago
- [ICLR 2025] OpenVid-1M: A Large-Scale High-Quality Dataset for Text-to-video Generationβ248Updated this week
- [CVPR 2025] π₯ Official impl. of "TokenFlow: Unified Image Tokenizer for Multimodal Understanding and Generation".β273Updated this week
- Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)β130Updated last year
- The paper collections for the autoregressive models in vision.β422Updated this week
- [CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficientβ79Updated this week
- β56Updated last month
- βοΈ Accelerating Vision Diffusion Transformers with Skip Branches.β60Updated 2 months ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Diβ¦β58Updated 9 months ago
- [ICLR25] High-performance Image Tokenizers for VAR and ARβ206Updated 2 weeks ago
- The collection of awesome papers on alignment of diffusion models.β119Updated 2 weeks ago
- Official implementation of paper "SparseVLM: Visual Token Sparsification for Efficient Vision-Language Model Inference".β77Updated 4 months ago
- [NeurIPS 2024 Oralπ₯] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.β143Updated 5 months ago
- [CVPR 2025] TinyFusion: Diffusion Transformers Learned Shallowβ80Updated 2 months ago
- LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Modelsβ116Updated 9 months ago