a list of awesome papers on deep model ompression and acceleration
☆350Jun 19, 2021Updated 4 years ago
Alternatives and similar repositories for awesome-model-compression-and-acceleration
Users that are interested in awesome-model-compression-and-acceleration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆670Aug 25, 2021Updated 4 years ago
- Papers for deep neural network compression and acceleration☆401Jun 21, 2021Updated 4 years ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆336Jul 25, 2024Updated last year
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,909Mar 31, 2023Updated 3 years ago
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,089May 2, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- This is my final year project of Bachelor of Engineering. Its still incomplete though. I am trying to replicate the research paper "Deep …☆77Sep 21, 2017Updated 8 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,515Jun 7, 2020Updated 5 years ago
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆857Jun 19, 2021Updated 4 years ago
- collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning☆568Feb 3, 2024Updated 2 years ago
- Caffe for Deep Compression☆240Nov 17, 2017Updated 8 years ago
- PyTorch Implementation of [1611.06440] Pruning Convolutional Neural Networks for Resource Efficient Inference☆886Jul 12, 2019Updated 6 years ago
- Collection of recent methods on (deep) neural network compression and acceleration.☆954Apr 4, 2025Updated last year
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,548Aug 28, 2019Updated 6 years ago
- ShuffleNet-V2 for both PyTorch and Caffe.☆505Aug 9, 2018Updated 7 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Network acceleration methods☆178Jun 19, 2021Updated 4 years ago
- Implementation of model compression with knowledge distilling method.☆342Jan 3, 2017Updated 9 years ago
- Code and Pretrained model for IGCV3☆189Oct 22, 2018Updated 7 years ago
- Awesome Knowledge Distillation☆3,844Mar 22, 2026Updated 3 weeks ago
- Network Slimming (Pytorch) (ICCV 2017)☆919Nov 6, 2020Updated 5 years ago
- deep learning model compression based on keras☆32Aug 10, 2018Updated 7 years ago
- caffe model of ICCV'17 paper - ThiNet: A Filter Level Pruning Method for Deep Neural Network Compression https://arxiv.org/abs/1707.06342☆148Sep 19, 2018Updated 7 years ago
- Generate a quantization parameter file for ncnn framework int8 inference☆518Jul 29, 2020Updated 5 years ago
- Code for: "And the bit goes down: Revisiting the quantization of neural networks"☆631Nov 9, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Learning both Weights and Connections for Efficient Neural Networks https://arxiv.org/abs/1506.02626☆182Nov 10, 2022Updated 3 years ago
- Soft Filter Pruning for Accelerating Deep Convolutional Neural Networks☆386Oct 2, 2019Updated 6 years ago
- A curated list of neural network pruning resources.☆2,492Apr 4, 2024Updated 2 years ago
- Reducing the size of convolutional neural networks☆113Nov 28, 2017Updated 8 years ago
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆170Feb 26, 2021Updated 5 years ago
- Codes for accepted paper "Cooperative Pruning in Cross-Domain Deep Neural Network Compression" in IJCAI 2019.☆12Aug 15, 2019Updated 6 years ago
- implementation for paper "ShelfNet for fast semantic segmentation"☆252Feb 27, 2021Updated 5 years ago
- Caffe for Sparse and Low-rank Deep Neural Networks☆382Mar 8, 2020Updated 6 years ago
- Pelee: A Real-Time Object Detection System on Mobile Devices☆885Jan 4, 2019Updated 7 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- MetaPruning: Meta Learning for Automatic Neural Network Channel Pruning. In ICCV 2019.☆352Jul 5, 2020Updated 5 years ago
- PyTorch implementation of Wide Residual Networks with 1-bit weights by McDonnell (ICLR 2018)☆126Sep 6, 2018Updated 7 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆409Nov 22, 2022Updated 3 years ago
- Implementation for Trained Ternary Network.☆108Jan 13, 2017Updated 9 years ago
- Pretrained ConvNets for pytorch: NASNet, ResNeXt, ResNet, InceptionV4, InceptionResnetV2, Xception, DPN, etc.☆77Mar 29, 2018Updated 8 years ago
- Implementation of ICLR 2017 paper "Loss-aware Binarization of Deep Networks"☆20Feb 24, 2019Updated 7 years ago
- implementation of Iterative Pruning for Deep neural network [Han2015].☆40Jul 17, 2018Updated 7 years ago