papers about model compression
☆166Feb 10, 2023Updated 3 years ago
Alternatives and similar repositories for awesome-model-compression
Users that are interested in awesome-model-compression are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Network acceleration methods☆178Jun 19, 2021Updated 4 years ago
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆539Sep 21, 2024Updated last year
- Papers for deep neural network compression and acceleration☆401Jun 21, 2021Updated 4 years ago
- ☆668Aug 25, 2021Updated 4 years ago
- Repository to track the progress in model compression and acceleration☆106Jun 19, 2021Updated 4 years ago
- A curated list of neural network pruning resources.☆2,491Apr 4, 2024Updated last year
- Collection of recent methods on (deep) neural network compression and acceleration.☆954Apr 4, 2025Updated 11 months ago
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆856Jun 19, 2021Updated 4 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,517Jun 7, 2020Updated 5 years ago
- Implementations of the XNOR networks☆12Aug 9, 2017Updated 8 years ago
- PyTorch Model Compression☆234Jan 27, 2023Updated 3 years ago
- Summary, Code for Deep Neural Network Quantization☆559Jun 14, 2025Updated 9 months ago
- Implements quantized distillation. Code for our paper "Model compression via distillation and quantization"☆336Jul 25, 2024Updated last year
- Class Hierarchy with CTags for Sublime Text 2☆20Apr 29, 2014Updated 11 years ago
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,088May 2, 2024Updated last year
- ☆26Apr 12, 2022Updated 3 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆245Aug 30, 2022Updated 3 years ago
- implement of DoReFaNet with tensorflow based on cifar10 dataset☆28Nov 8, 2017Updated 8 years ago
- Reproduction of WAGE in PyTorch.☆44Nov 18, 2018Updated 7 years ago
- Single-Path NAS: Designing Hardware-Efficient ConvNets in less than 4 Hours☆395Dec 14, 2020Updated 5 years ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,657May 30, 2023Updated 2 years ago
- This repo is about NAS☆26Nov 1, 2019Updated 6 years ago
- Awesome Knowledge Distillation☆3,826Updated this week
- Latte is a convolutional neural network (CNN) inference engine written in C++ and uses AVX to vectorize operations. The engine runs on Wi…☆13Jun 25, 2018Updated 7 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- [ICDCS 2023] Evaluation and Optimization of Gradient Compression for Distributed Deep Learning☆10Apr 28, 2023Updated 2 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆22Nov 15, 2020Updated 5 years ago
- Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization☆584Feb 15, 2023Updated 3 years ago
- our code☆33Jun 11, 2021Updated 4 years ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Apr 25, 2023Updated 2 years ago
- All about acceleration and compression of Deep Neural Networks☆33Nov 5, 2019Updated 6 years ago
- Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)☆10Jun 1, 2021Updated 4 years ago
- Topology Distillation for Recommender System (KDD'21)☆13Sep 2, 2021Updated 4 years ago
- ☆13Apr 10, 2017Updated 8 years ago
- The code for Channel-Level Variable Quantization Network for Deep Image Compression (IJCAI 2020)☆30Jul 23, 2021Updated 4 years ago
- Knowledge Distillation Algorithms implemented with PyTorch☆17Jul 23, 2019Updated 6 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,270May 6, 2025Updated 10 months ago
- PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.☆435Jul 7, 2023Updated 2 years ago
- Codes for our paper "Exploring Bit-Slice Sparsity in Deep Neural Networks for Efficient ReRAM-Based Deployment" [NeurIPS'19 EMC2 workshop]…☆10Oct 12, 2020Updated 5 years ago