sun254/awesome-model-compression-and-acceleration

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sun254/awesome-model-compression-and-acceleration)

sun254 / awesome-model-compression-and-acceleration

a list of awesome papers on deep model ompression and acceleration

☆348

Alternatives and similar repositories for awesome-model-compression-and-acceleration

Users that are interested in awesome-model-compression-and-acceleration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

memoiry / Awesome-model-compression-and-acceleration
View on GitHub
☆666Aug 25, 2021Updated 4 years ago
chester256 / Model-Compression-Papers
View on GitHub
Papers for deep neural network compression and acceleration
☆402Jun 21, 2021Updated 5 years ago
antspy / quantized_distillation
View on GitHub
Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"
☆336Jul 25, 2024Updated 2 years ago
Tencent / PocketFlow
View on GitHub
An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.
☆2,909Mar 31, 2023Updated 3 years ago
ethanhe42 / channel-pruning
View on GitHub
Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)
☆1,089May 2, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
hiteshvaidya / Model-Compression
View on GitHub
This is my final year project of Bachelor of Engineering. Its still incomplete though. I am trying to replicate the research paper "Deep …
☆77Sep 21, 2017Updated 8 years ago
Eric-mingjie / rethinking-network-pruning
View on GitHub
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
☆1,512Jun 7, 2020Updated 6 years ago
guan-yuan / Awesome-AutoML-and-Lightweight-Models
View on GitHub
A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…
☆856Jun 19, 2021Updated 5 years ago
zssloth / Embedded-Neural-Network
View on GitHub
collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning
☆568Feb 3, 2024Updated 2 years ago
may0324 / DeepCompression-caffe
View on GitHub
Caffe for Deep Compression
☆238Nov 17, 2017Updated 8 years ago
jacobgil / pytorch-pruning
View on GitHub
PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference
☆885Jul 12, 2019Updated 7 years ago
pytorch / QNNPACK
View on GitHub
Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators
☆1,550Aug 28, 2019Updated 6 years ago
MingSun-Tse / Efficient-Deep-Learning
View on GitHub
Collection of recent methods on (deep) neural network compression and acceleration.
☆956Apr 4, 2025Updated last year
miaow1988 / ShuffleNet_V2_pytorch_caffe
View on GitHub
ShuffleNet-V2 for both PyTorch and Caffe.
☆504Aug 9, 2018Updated 7 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
mrgloom / Network-Speed-and-Compression
View on GitHub
Network acceleration methods
☆178Jun 19, 2021Updated 5 years ago
chengshengchan / model_compression
View on GitHub
Implementation of model compression with knowledge distilling method.
☆342Jan 3, 2017Updated 9 years ago
homles11 / IGCV3
View on GitHub
Code and Pretrained model for IGCV3
☆189Oct 22, 2018Updated 7 years ago
dkozlov / awesome-knowledge-distillation
View on GitHub
Awesome Knowledge Distillation
☆3,890May 25, 2026Updated 2 months ago
Eric-mingjie / network-slimming
View on GitHub
Network Slimming (Pytorch) (ICCV 2017)
☆919Nov 6, 2020Updated 5 years ago
guoxiaolu / model_compression
View on GitHub
deep learning model compression based on keras
☆32Aug 10, 2018Updated 7 years ago
Roll920 / ThiNet
View on GitHub
caffe model of ICCV'17 paper - ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression https://arxiv.org/abs/1707.06342
☆149Sep 19, 2018Updated 7 years ago
BUG1989 / caffe-int8-convert-tools
View on GitHub
Generate a quantization parameter file for ncnn framework int8 inference
☆517Jul 29, 2020Updated 5 years ago
facebookresearch / kill-the-bits
View on GitHub
Code for: "And the bit goes down: Revisiting the quantization of neural networks"
☆630Nov 9, 2020Updated 5 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
jack-willturner / deep-compression
View on GitHub
Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626
☆182Nov 10, 2022Updated 3 years ago
he-y / soft-filter-pruning
View on GitHub
Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks
☆386Oct 2, 2019Updated 6 years ago
he-y / Awesome-Pruning
View on GitHub
A curated list of neural network pruning resources.
☆2,496Apr 4, 2024Updated 2 years ago
TropComplique / trained-ternary-quantization
View on GitHub
Reducing the size of convolutional neural networks
☆114Nov 28, 2017Updated 8 years ago
mit-han-lab / amc-models
View on GitHub
[ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices
☆168Feb 26, 2021Updated 5 years ago
taoyizhi68 / py-data-augmentation
View on GitHub
python image data augmentation
☆12Jul 24, 2017Updated 9 years ago
csyhhu / Co-Prune
View on GitHub
Codes for accepted paper "Cooperative Pruning in Cross-Domain Deep Neural Network Compression" in IJCAI 2019.
☆12Aug 15, 2019Updated 6 years ago
juntang-zhuang / ShelfNet
View on GitHub
implementation for paper "ShelfNet for fast semantic segmentation"
☆252Feb 27, 2021Updated 5 years ago
wenwei202 / caffe
View on GitHub
Caffe for Sparse and Low-rank Deep Neural Networks
☆382Mar 8, 2020Updated 6 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Robert-JunWang / Pelee
View on GitHub
Pelee: A Real-Time Object Detection System on Mobile Devices
☆884Jan 4, 2019Updated 7 years ago
szagoruyko / binary-wide-resnet
View on GitHub
PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)
☆126Sep 6, 2018Updated 7 years ago
liuzechun / MetaPruning
View on GitHub
MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. In ICCV 2019.
☆352Jul 5, 2020Updated 6 years ago
deepglint / EasyQuant
View on GitHub
EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…
☆407Nov 22, 2022Updated 3 years ago
veronikayurchuk / pretrained-models.pytorch
View on GitHub
Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.
☆76Mar 29, 2018Updated 8 years ago
mit-han-lab / proxylessnas
View on GitHub
[ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware
☆1,446Aug 30, 2024Updated last year
houlu369 / Loss-aware-Binarization
View on GitHub
Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"
☆20Feb 24, 2019Updated 7 years ago