hrcheng1066 / awesome-pruningLinks

☆263

Alternatives and similar repositories for awesome-pruning

Users that are interested in awesome-pruning are comparing it to the libraries listed below

Sorting:

ghimiredhikura / Awasome-Pruning
Awasome Papers and Resources in Deep Neural Network Pruning with Source Code.
☆160Updated 10 months ago
Zhen-Dong / Awesome-Quantization-Papers
List of papers related to neural network quantization in recent AI conferences and journals.
☆669Updated 3 months ago
xidongwu / AutoTrainOnce
☆17Updated 9 months ago
IST-DASLab / OBC
Code for the NeurIPS 2022 paper "Optimal Brain Compression: A Framework for Accurate Post-Training Quantization and Pruning".
☆122Updated 2 years ago
pprp / Awesome-LLM-Prune
Awesome list for LLM pruning.
☆241Updated 7 months ago
Efficient-ML / Awesome-Efficient-AIGC
A list of papers, docs, codes about efficient AIGC. This repo is aimed to provide the info for efficient AIGC research, including languag…
☆186Updated 5 months ago
wimh966 / QDrop
The official PyTorch implementation of the ICLR2022 paper, QDrop: Randomly Dropping Quantization for Extremely Low-bit Post-Training Quan…
☆122Updated 2 years ago
hahnyuan / PTQ4ViT
Post-Training Quantization for Vision transformers.
☆221Updated 3 years ago
megvii-research / FQ-ViT
[IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer
☆344Updated 2 years ago
WoosukKwon / retraining-free-pruning
[NeurIPS 2022] A Fast Post-Training Pruning Framework for Transformers
☆190Updated 2 years ago
Hsu1023 / DuQuant
[NeurIPS 2024 Oral🔥] DuQuant: Distributing Outliers via Dual Transformation Makes Stronger Quantized LLMs.
☆164Updated 9 months ago
lihuantong / HAST
☆12Updated last year
yhhhli / BRECQ
Pytorch implementation of BRECQ, ICLR 2021
☆277Updated 3 years ago
liyunqianggyn / Awesome-LLMs-Pruning
Awesome LLM pruning papers all-in-one repository with integrating all useful resources and insights.
☆96Updated 7 months ago
biomedical-cybernetics / Relative-importance-and-activation-pruning
☆48Updated last year
tianyic / only_train_once_personal_footprint
OTOv1-v3, NeurIPS, ICLR, TMLR, DNN Training, Compression, Structured Pruning, Erasing Operators, CNN, Diffusion, LLM
☆305Updated 10 months ago
liuzechun / Nonuniform-to-Uniform-Quantization
Nonuniform-to-Uniform Quantization: Towards Accurate Quantization via Generalized Straight-Through Estimation. In CVPR 2022.
☆133Updated 3 years ago
pprp / Awesome-LLM-Quantization
Awesome list for LLM quantization
☆252Updated last month
xuke225 / EQ-Net
EQ-Net [ICCV 2023]
☆29Updated last year
DD-DuDa / awesome-vit-quantization-acceleration
List of papers related to Vision Transformers quantization and hardware acceleration in recent AI conferences and journals.
☆92Updated last year
YanjingLi0202 / Q-ViT
The official implementation of the NeurIPS 2022 paper Q-ViT.
☆96Updated 2 years ago
zejiangh / Filter-GaP
The official PyTorch implementation of CHEX: CHannel EXploration for CNN Model Compression (CVPR 2022). Paper is available at https://ope…
☆38Updated 3 years ago
falcon-xu / early-exit-papers
A curated list of early exiting (LLM, CV, NLP, etc)
☆57Updated 10 months ago
Beryex / RLPruner-CNN
RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration
☆20Updated last month
BrotherHappy / OSTQuant
[ICLR2025]: OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitt…
☆68Updated 3 months ago
NVlabs / NViT
☆21Updated last year
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆56Updated 2 years ago
xvyaward / owq
Code for the AAAI 2024 Oral paper "OWQ: Outlier-Aware Weight Quantization for Efficient Fine-Tuning and Inference of Large Language Model…
☆63Updated last year
zhutmost / lsq-net
Unofficial implementation of LSQ-Net, a neural network quantization framework
☆298Updated last year
Qualcomm-AI-research / pruning-vs-quantization
☆22Updated last year