oahzxl / Awesome-Efficient-Video-GenerationLinks
A curated list of recent efficient video generation methods.
☆21Updated this week
Alternatives and similar repositories for Awesome-Efficient-Video-Generation
Users that are interested in Awesome-Efficient-Video-Generation are comparing it to the libraries listed below
Sorting:
- A WebUI for Side-by-Side Comparison of Media (Images/Videos) Across Multiple Folders☆23Updated 6 months ago
- [ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality…☆52Updated 5 months ago
- [NeurIPS 2024] Learning-to-Cache: Accelerating Diffusion Transformer via Layer Caching☆111Updated last year
- ☆90Updated 3 months ago
- Code for Draft Attention☆90Updated 3 months ago
- [ICML2025] LoRA fine-tune directly on the quantized models.☆35Updated 9 months ago
- Triton implement of bi-directional (non-causal) linear attention☆51Updated 6 months ago
- An open-source implementation of Regional Adaptive Sampling (RAS), a novel diffusion model sampling strategy that introduces regional var…☆138Updated 2 months ago
- [ECCV24] MixDQ: Memory-Efficient Few-Step Text-to-Image Diffusion Models with Metric-Decoupled Mixed Precision Quantization☆14Updated 9 months ago
- VidKV: Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models☆22Updated 5 months ago
- ScaleKV: Memory-Efficient Visual Autoregressive Modeling with Scale-Aware KV Cache Compression☆46Updated 2 months ago
- Curated list of methods that focuses on improving the efficiency of diffusion models☆46Updated last year
- Official implementation of paper "VMoBA: Mixture-of-Block Attention for Video Diffusion Models"☆41Updated last month
- ☆16Updated last year
- [ICML 2025] SparseLoRA: Accelerating LLM Fine-Tuning with Contextual Sparsity☆51Updated last month
- Paper survey of efficient computation for large scale models.☆34Updated 8 months ago
- [ICML24] Pruner-Zero: Evolving Symbolic Pruning Metric from scratch for LLMs☆92Updated 9 months ago
- TerDiT: Ternary Diffusion Models with Transformers☆71Updated last year
- ☆14Updated 5 months ago
- [ICML 2025] XAttention: Block Sparse Attention with Antidiagonal Scoring☆219Updated last month
- [ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…☆103Updated last year
- [CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers☆56Updated 11 months ago
- [ICLR 2024 Spotlight] This is the official PyTorch implementation of "EfficientDM: Efficient Quantization-Aware Fine-Tuning of Low-Bit Di…☆64Updated last year
- Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation☆67Updated last month
- FORA introduces simple yet effective caching mechanism in Diffusion Transformer Architecture for faster inference sampling.☆48Updated last year
- ☆173Updated 7 months ago
- [ICML 2024 Oral] This project is the official implementation of our Accurate LoRA-Finetuning Quantization of LLMs via Information Retenti…☆67Updated last year
- ☆33Updated 4 months ago
- The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models☆100Updated last year
- [ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models☆52Updated 2 months ago