a list of awesome papers on deep model ompression and acceleration
☆350Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-model-compression-and-acceleration
Users that are interested in awesome-model-compression-and-acceleration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆668Aug 25, 2021Updated 4 years ago
- Papers for deep neural network compression and acceleration☆401Jun 21, 2021Updated 4 years ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆336Jul 25, 2024Updated last year
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,911Mar 31, 2023Updated 2 years ago
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,089May 2, 2024Updated last year
- NordVPN Threat Protection Pro™ • AdTake your cybersecurity to the next level. Block phishing, malware, trackers, and ads. Lightweight app that works with all browsers.
- This is my final year project of Bachelor of Engineering. Its still incomplete though. I am trying to replicate the research paper "Deep …☆77Sep 21, 2017Updated 8 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,517Jun 7, 2020Updated 5 years ago
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆856Jun 19, 2021Updated 4 years ago
- collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning☆568Feb 3, 2024Updated 2 years ago
- Caffe for Deep Compression☆240Nov 17, 2017Updated 8 years ago
- PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference☆887Jul 12, 2019Updated 6 years ago
- Collection of recent methods on (deep) neural network compression and acceleration.☆954Apr 4, 2025Updated 11 months ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,549Aug 28, 2019Updated 6 years ago
- ShuffleNet-V2 for both PyTorch and Caffe.☆505Aug 9, 2018Updated 7 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Network acceleration methods☆178Jun 19, 2021Updated 4 years ago
- Implementation of model compression with knowledge distilling method.☆342Jan 3, 2017Updated 9 years ago
- Code and Pretrained model for IGCV3☆189Oct 22, 2018Updated 7 years ago
- Awesome Knowledge Distillation☆3,826Updated this week
- Network Slimming (Pytorch) (ICCV 2017)☆919Nov 6, 2020Updated 5 years ago
- deep learning model compression based on keras☆32Aug 10, 2018Updated 7 years ago
- caffe model of ICCV'17 paper - ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression https://arxiv.org/abs/1707.06342☆148Sep 19, 2018Updated 7 years ago
- Generate a quantization parameter file for ncnn framework int8 inference☆518Jul 29, 2020Updated 5 years ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆631Nov 9, 2020Updated 5 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626☆181Nov 10, 2022Updated 3 years ago
- Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks☆385Oct 2, 2019Updated 6 years ago
- A curated list of neural network pruning resources.☆2,491Apr 4, 2024Updated last year
- Reducing the size of convolutional neural networks☆113Nov 28, 2017Updated 8 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆169Feb 26, 2021Updated 5 years ago
- Codes for accepted paper "Cooperative Pruning in Cross-Domain Deep Neural Network Compression" in IJCAI 2019.☆12Aug 15, 2019Updated 6 years ago
- implementation for paper "ShelfNet for fast semantic segmentation"☆252Feb 27, 2021Updated 5 years ago
- Caffe for Sparse and Low-rank Deep Neural Networks☆382Mar 8, 2020Updated 6 years ago
- Pelee: A Real-Time Object Detection System on Mobile Devices☆886Jan 4, 2019Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. In ICCV 2019.☆352Jul 5, 2020Updated 5 years ago
- PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)☆126Sep 6, 2018Updated 7 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆408Nov 22, 2022Updated 3 years ago
- Implementation for Trained Ternary Network.☆108Jan 13, 2017Updated 9 years ago
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆77Mar 29, 2018Updated 7 years ago
- Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"☆20Feb 24, 2019Updated 7 years ago
- implementation of Iterative Pruning for Deep neural network [Han2015].☆40Jul 17, 2018Updated 7 years ago