Efficient-ML / Awesome-Efficient-AIGCLinks

A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including language and vision, we are continuously improving the project. Welcome to PR the works (papers, repositories) that are missed by the repo.

☆197

Alternatives and similar repositories for Awesome-Efficient-AIGC

Users that are interested in Awesome-Efficient-AIGC are comparing it to the libraries listed below

Sorting:

Hsu1023 / DuQuant
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
☆173Updated last year
thu-nics / qllm-eval
Code Repository of Evaluating Quantized Large Language Models
☆133Updated last year
42Shawn / PTQ4DM
Implementation of Post-training Quantization on Diffusion Models (CVPR 2023)
☆139Updated 2 years ago
IST-DASLab / OBC
Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".
☆129Updated 2 years ago
thu-nics / MBQ
The code repository of "MBQ: Modality-Balanced Quantization for Large Vision-Language Models"
☆64Updated 7 months ago
ruikangliu / FlatQuant
[ICML 2025] Official PyTorch implementation of "FlatQuant: Flatness Matters for LLM Quantization"
☆181Updated 2 weeks ago
adreamwu / PTQ4DiT
PyTorch implementation of PTQ4DiT https://arxiv.org/abs/2405.16005
☆37Updated 11 months ago
thu-nics / ViDiT-Q
[ICLR'25] ViDiT-Q: Efficient and Accurate Quantization of Diffusion Transformers for Image and Video Generation
☆126Updated 7 months ago
ModelTC / TFMQ-DM
[CVPR 2024 Highlight & TPAMI 2025] This is the official PyTorch implementation of "TFMQ-DM: Temporal Feature Maintenance Quantization for…
☆106Updated last month
wimh966 / QDrop
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆124Updated last month
pprp / Awesome-LLM-Prune
Awesome list for LLM pruning.
☆271Updated 2 weeks ago
pprp / Awesome-LLM-Quantization
Awesome list for LLM quantization
☆330Updated 2 weeks ago
ChenMnZ / PrefixQuant
An algorithm for weight-activation quantization (W4A4, W4A8) of LLMs, supporting both static and dynamic quantization
☆161Updated this week
BrotherHappy / OSTQuant
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…
☆81Updated 6 months ago
ZHITENGLI / ARB-LLM
[ICLR'25] ARB-LLM: Alternating Refined Binarizations for Large Language Models
☆27Updated 2 months ago
Intelligent-Computing-Lab-Panda / GPTAQ
Code implementation of GPTAQ (https://arxiv.org/abs/2504.02692)
☆68Updated 3 months ago
AboveParadise / LLMCBench
☆24Updated 10 months ago
ghimiredhikura / Awasome-Pruning
Awesome Pruning. ✅ Curated Resources for Neural Network Pruning.
☆170Updated last year
liyunqianggyn / Awesome-LLMs-Pruning
Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.
☆129Updated 2 months ago
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆228Updated 3 years ago
Juanerx / Q-DiT
[CVPR 2025] Q-DiT: Accurate Post-Training Quantization for Diffusion Transformers
☆67Updated last year
nbasyl / OFQ
The official implementation of the ICML 2023 paper OFQ-ViT
☆32Updated 2 years ago
A-suozhang / Awesome-Efficient-Diffusion
Curated list of methods that focuses on improving the efficiency of diffusion models
☆44Updated last year
megvii-research / FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆351Updated 2 years ago
htqin / BiBench
[ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…
☆55Updated last year
Guangxuan-Xiao / torch-int
This repository contains integer operators on GPUs for PyTorch.
☆220Updated 2 years ago
pprp / Awesome-Efficient-MoE
Efficient Mixture of Experts for LLM Paper List
☆142Updated last month
SNU-ARC / any-precision-llm
[ICML 2024 Oral] Any-Precision LLM: Low-Cost Deployment of Multiple, Different-Sized LLMs
☆118Updated 3 months ago
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆59Updated 2 years ago
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆95Updated last year