ThisisBillhe/ZipAR

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ThisisBillhe/ZipAR)

ThisisBillhe / ZipAR

[ICML 2025] This is the official PyTorch implementation of "ZipAR: Accelerating Auto-regressive Image Generation through Spatial Locality"

☆51

Alternatives and similar repositories for ZipAR

Users that are interested in ZipAR are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ThisisBillhe / ZipCache
View on GitHub
[NeurIPS 2024] The official implementation of ZipCache: Accurate and Efficient KV Cache Quantization with Salient Token Identification
☆33Mar 30, 2025Updated last year
ThisisBillhe / NAR
View on GitHub
[ICCV 2025] The official implementation of "Neighboring Autoregressive Modeling for Efficient Visual Generation"
☆62Apr 5, 2025Updated last year
alibaba-damo-academy / K-Forcing
View on GitHub
Official implementation for "K-Forcing: Joint Next-K-Token Decoding via Push-Forward Language Modeling"
☆16Jun 14, 2026Updated last month
ThisisBillhe / torch_quantizer
View on GitHub
torch_quantizer is a out-of-box quantization tool for PyTorch models on CUDA backend, specially optimized for Diffusion Models.
☆25Mar 29, 2024Updated 2 years ago
Chenfeng1271 / SVDiff
View on GitHub
Streaming Video Diffusion: Online Video Editing with Diffusion Models
☆17Jun 3, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ziplab / efficient-stable-diffusion
View on GitHub
☆16Sep 12, 2023Updated 2 years ago
A-suozhang / ViDiT-Q
View on GitHub
☆15Mar 21, 2025Updated last year
ziplab / PTQD
View on GitHub
The official implementation of PTQD: Accurate Post-Training Quantization for Diffusion Models
☆103Mar 12, 2024Updated 2 years ago
NVlabs / T-Stitch
View on GitHub
[ICLR 2025] Official PyTorch implmentation of paper "T-Stitch: Accelerating Sampling in Pre-trained Diffusion Models with Trajectory Stit…
☆107Feb 26, 2024Updated 2 years ago
maxin-cn / Awesome-Autoregressive-Visual-Generation-Models
View on GitHub
a collection of awesome autoregressive visual generation models
☆82Apr 17, 2025Updated last year
czg1225 / CoDe
View on GitHub
[CVPR 2025] CoDe: Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient
☆108Sep 27, 2025Updated 9 months ago
alibaba-damo-academy / WorldOlympiad
View on GitHub
WorldOlympiad: Can Your World Model Survive a Triathlon?
☆54Updated this week
ziplab / Pyramid-Sparse-Attention
View on GitHub
Official PyTorch implementation of [PSA: Pyramid Sparse Attention for Efficient Video Understanding and Generation](https://arxiv.org/abs…
☆25Jan 25, 2026Updated 5 months ago
hywang66 / LARP
View on GitHub
Official Pytorch implementation for LARP: Tokenizing Videos with a Learned Autoregressive Generative Prior (ICLR 2025 Oral).
☆107Feb 11, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
A-suozhang / Awesome-Efficient-Diffusion
View on GitHub
Curated list of methods that focuses on improving the efficiency of diffusion models
☆43Jul 9, 2024Updated 2 years ago
ilur98 / DGQ
View on GitHub
Official Code For Dual Grained Quantization: Efficient Fine-Grained Quantization for LLM
☆14Dec 27, 2023Updated 2 years ago
Huage001 / LinFusion
View on GitHub
Official PyTorch and Diffusers Implementation of "LinFusion: 1 GPU, 1 Minute, 16K Image"
☆317Dec 23, 2024Updated last year
ThisisBillhe / BiViT
View on GitHub
The official implementation of BiViT: Extremely Compressed Binary Vision Transformers
☆16Jun 18, 2023Updated 3 years ago
DengZeshuai / SRTTA
View on GitHub
☆37Jan 25, 2024Updated 2 years ago
AI-Efficiency / Awesome-Efficient-AIGC
View on GitHub
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…
☆206Feb 10, 2025Updated last year
thu-nics / ViDiT-Q
View on GitHub
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆163Mar 21, 2025Updated last year
ziplab / CoV
View on GitHub
[ACL 2026 Findings] CoV: Chain-of-View Prompting for Spatial Reasoning
☆63Apr 7, 2026Updated 3 months ago
TZW1998 / ParaTAA-Diffusion
View on GitHub
This is the official repo for the paper "Accelerating Parallel Sampling of Diffusion Models" Tang et al. ICML 2024 https://openreview.net…
☆16Jul 19, 2024Updated 2 years ago
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
thu-nics / DiTFastAttn
View on GitHub
☆192Jan 14, 2025Updated last year
AkideLiu / MiniCache
View on GitHub
☆14Sep 7, 2024Updated last year
zysxmu / DFSQ
View on GitHub
super-resolution; post-training quantization; model compression
☆14Nov 10, 2023Updated 2 years ago
THUNLP-MT / ActiView
View on GitHub
☆11Dec 20, 2024Updated last year
thu-nics / MoA
View on GitHub
[CoLM'25] The official implementation of the paper <MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression>
☆159Jan 14, 2026Updated 6 months ago
MarkXCloud / CSpD
View on GitHub
The official repo of continuous speculative decoding
☆36Mar 28, 2025Updated last year
hatchetProject / QuEST
View on GitHub
[ICCV 2025] QuEST: Efficient Finetuning for Low-bit Diffusion Models
☆60Jun 26, 2025Updated last year
Alpha-VLLM / Lumina-mGPT
View on GitHub
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraini…
☆646Oct 16, 2025Updated 9 months ago
ziplab / BLADE
View on GitHub
[ICLR 2026] This is the official PyTorch implementation of "BLADE: Block-Sparse Attention Meets Step Distillation for Efficient Video Gen…
☆49Oct 9, 2025Updated 9 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
HubHop / vit-attention-benchmark
View on GitHub
Benchmarking Attention Mechanism in Vision Transformers.
☆20Oct 10, 2022Updated 3 years ago
42Shawn / PTQ4DM
View on GitHub
Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)
☆146Apr 1, 2023Updated 3 years ago
ModelTC / QLLM
View on GitHub
[ICLR 2024] This is the official PyTorch implementation of "QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Mod…
☆39Mar 11, 2024Updated 2 years ago
ali-vilab / ProMoE
View on GitHub
[ICLR2026] The official code of "Routing Matters in MoE: Scaling Diffusion Transformers with Explicit Routing Guidance"
☆46Mar 23, 2026Updated 3 months ago
mit-han-lab / lpd
View on GitHub
[ICLR 2026 Oral] Locality-aware Parallel Decoding for Efficient Autoregressive Image Generation
☆104May 8, 2026Updated 2 months ago
SciMT / SciMT-benchmark
View on GitHub
☆11Jan 3, 2024Updated 2 years ago
xuyang-liu16 / Awesome-Generation-Acceleration
View on GitHub
📚 Collection of awesome generation acceleration resources.
☆401Jul 7, 2025Updated last year