a list of awesome papers on deep model ompression and acceleration
☆350Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-model-compression-and-acceleration
Users that are interested in awesome-model-compression-and-acceleration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆669Aug 25, 2021Updated 4 years ago
- Papers for deep neural network compression and acceleration☆401Jun 21, 2021Updated 4 years ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆336Jul 25, 2024Updated last year
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,913Mar 31, 2023Updated 3 years ago
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,089May 2, 2024Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is my final year project of Bachelor of Engineering. Its still incomplete though. I am trying to replicate the research paper "Deep …☆77Sep 21, 2017Updated 8 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,512Jun 7, 2020Updated 6 years ago
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆855Jun 19, 2021Updated 4 years ago
- collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning☆568Feb 3, 2024Updated 2 years ago
- Caffe for Deep Compression☆239Nov 17, 2017Updated 8 years ago
- PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference☆885Jul 12, 2019Updated 6 years ago
- Collection of recent methods on (deep) neural network compression and acceleration.☆955Apr 4, 2025Updated last year
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,549Aug 28, 2019Updated 6 years ago
- ShuffleNet-V2 for both PyTorch and Caffe.☆504Aug 9, 2018Updated 7 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Network acceleration methods☆178Jun 19, 2021Updated 4 years ago
- Implementation of model compression with knowledge distilling method.☆342Jan 3, 2017Updated 9 years ago
- Code and Pretrained model for IGCV3☆189Oct 22, 2018Updated 7 years ago
- Awesome Knowledge Distillation☆3,881May 25, 2026Updated 3 weeks ago
- Network Slimming (Pytorch) (ICCV 2017)☆920Nov 6, 2020Updated 5 years ago
- deep learning model compression based on keras☆32Aug 10, 2018Updated 7 years ago
- caffe model of ICCV'17 paper - ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression https://arxiv.org/abs/1707.06342☆149Sep 19, 2018Updated 7 years ago
- Generate a quantization parameter file for ncnn framework int8 inference☆517Jul 29, 2020Updated 5 years ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆630Nov 9, 2020Updated 5 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626☆182Nov 10, 2022Updated 3 years ago
- Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks☆385Oct 2, 2019Updated 6 years ago
- A curated list of neural network pruning resources.☆2,494Apr 4, 2024Updated 2 years ago
- Reducing the size of convolutional neural networks☆114Nov 28, 2017Updated 8 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆169Feb 26, 2021Updated 5 years ago
- Codes for accepted paper "Cooperative Pruning in Cross-Domain Deep Neural Network Compression" in IJCAI 2019.☆12Aug 15, 2019Updated 6 years ago
- implementation for paper "ShelfNet for fast semantic segmentation"☆252Feb 27, 2021Updated 5 years ago
- Caffe for Sparse and Low-rank Deep Neural Networks☆382Mar 8, 2020Updated 6 years ago
- Pelee: A Real-Time Object Detection System on Mobile Devices☆883Jan 4, 2019Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. In ICCV 2019.☆352Jul 5, 2020Updated 5 years ago
- PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)☆126Sep 6, 2018Updated 7 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆407Nov 22, 2022Updated 3 years ago
- Implementation for Trained Ternary Network.☆108Jan 13, 2017Updated 9 years ago
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆76Mar 29, 2018Updated 8 years ago
- Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"☆20Feb 24, 2019Updated 7 years ago
- [ICLR 2019] ProxylessNAS: Direct Neural Architecture Search on Target Task and Hardware☆1,447Aug 30, 2024Updated last year