Papers for deep neural network compression and acceleration
☆401Jun 21, 2021Updated 4 years ago
Alternatives and similar repositories for Model-Compression-Papers
Users that are interested in Model-Compression-Papers are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆669Aug 25, 2021Updated 4 years ago
- a list of awesome papers on deep model ompression and acceleration☆350Jun 19, 2021Updated 4 years ago
- Network acceleration methods☆178Jun 19, 2021Updated 4 years ago
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆544Sep 21, 2024Updated last year
- A curated list of neural network pruning resources.☆2,493Apr 4, 2024Updated 2 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Collection of recent methods on (deep) neural network compression and acceleration.☆954Apr 4, 2025Updated last year
- papers about model compression☆166Feb 10, 2023Updated 3 years ago
- Summary, Code for Deep Neural Network Quantization☆562May 13, 2026Updated last week
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,512Jun 7, 2020Updated 5 years ago
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆856Jun 19, 2021Updated 4 years ago
- An Automatic Model Compression (AutoMC) framework for developing smaller and faster AI applications.☆2,911Mar 31, 2023Updated 3 years ago
- knowledge distillation papers☆766Feb 10, 2023Updated 3 years ago
- collection of works aiming at reducing model sizes or the ASIC/FPGA accelerator for machine learning☆568Feb 3, 2024Updated 2 years ago
- PyTorch implementation of 'Deep Compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman Coding' by …☆427Feb 27, 2020Updated 6 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆450Nov 22, 2023Updated 2 years ago
- Compress neural network with pruning and quantization using TensorFlow.☆106Dec 19, 2018Updated 7 years ago
- Paper list on model compression and acceleration☆26Jun 4, 2019Updated 6 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,274May 6, 2025Updated last year
- Awesome Knowledge Distillation☆3,866Mar 22, 2026Updated 2 months ago
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,089May 2, 2024Updated 2 years ago
- Automated Deep Learning: Neural Architecture Search Is Not the End (a curated list of AutoDL resources and an in-depth analysis)☆2,336Sep 26, 2022Updated 3 years ago
- Slimmable Networks, AutoSlim, and Beyond, ICLR 2019, and ICCV 2019☆929Mar 9, 2023Updated 3 years ago
- Network Slimming (Pytorch) (ICCV 2017)☆919Nov 6, 2020Updated 5 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- [ECCV 2018] AMC: AutoML for Model Compression and Acceleration on Mobile Devices☆169Feb 26, 2021Updated 5 years ago
- Deep Compression on AlexNet☆672Mar 5, 2022Updated 4 years ago
- Filter Pruning via Geometric Median for Deep Convolutional Neural Networks Acceleration (CVPR 2019 Oral)☆617Aug 31, 2023Updated 2 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆245Aug 30, 2022Updated 3 years ago
- Contrast between ShuffleNet V2 and MnasNet.(Non-official implement In PyTorch)☆12Oct 25, 2018Updated 7 years ago
- Caffe for Sparse and Low-rank Deep Neural Networks☆382Mar 8, 2020Updated 6 years ago
- Count the MACs / FLOPs of your PyTorch model.☆5,084Jul 8, 2024Updated last year
- Awesome Computer Vision Resources☆85Feb 22, 2019Updated 7 years ago
- ☆54Feb 11, 2019Updated 7 years ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Repository to track the progress in model compression and acceleration☆107Jun 19, 2021Updated 4 years ago
- ☆1,516Aug 27, 2020Updated 5 years ago
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,377May 11, 2026Updated last week
- Neural Network Quantization & Low-Bit Fixed Point Training For Hardware-Friendly Algorithm Design☆161Dec 18, 2020Updated 5 years ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,663May 30, 2023Updated 2 years ago
- Quantized Neural Network PACKage - mobile-optimized implementation of quantized neural network operators☆1,549Aug 28, 2019Updated 6 years ago
- Caffe/Neon prototxt training file for our Neurocomputing2017 work: Fuzzy Quantitative Deep Compression Network☆11May 30, 2018Updated 7 years ago