papers about model compression
☆166Feb 10, 2023Updated 3 years ago
Alternatives and similar repositories for awesome-model-compression
Users that are interested in awesome-model-compression are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Network acceleration methods☆178Jun 19, 2021Updated 4 years ago
- Awesome machine learning model compression research papers, quantization, tools, and learning material.☆544Sep 21, 2024Updated last year
- Papers for deep neural network compression and acceleration☆401Jun 21, 2021Updated 4 years ago
- ☆669Aug 25, 2021Updated 4 years ago
- Repository to track the progress in model compression and acceleration☆107Jun 19, 2021Updated 4 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- A curated list of neural network pruning resources.☆2,493Apr 4, 2024Updated 2 years ago
- Collection of recent methods on (deep) neural network compression and acceleration.☆954Apr 4, 2025Updated last year
- A list of high-quality (newest) AutoML works and lightweight models including 1.) Neural Architecture Search, 2.) Lightweight Structures,…☆856Jun 19, 2021Updated 4 years ago
- Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)☆1,512Jun 7, 2020Updated 5 years ago
- Implementations of the XNOR networks☆12Aug 9, 2017Updated 8 years ago
- PyTorch Model Compression☆234Jan 27, 2023Updated 3 years ago
- Summary, Code for Deep Neural Network Quantization☆562Updated this week
- Class Hierarchy with CTags for Sublime Text 2☆20Apr 29, 2014Updated 12 years ago
- a list of awesome papers on deep model ompression and acceleration☆350Jun 19, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Channel Pruning for Accelerating Very Deep Neural Networks (ICCV'17)☆1,089May 2, 2024Updated 2 years ago
- ☆26Apr 12, 2022Updated 4 years ago
- Aiming at an AI Chip based on RISC-V and NVDLA.☆21Mar 8, 2018Updated 8 years ago
- LQ-Nets: Learned Quantization for Highly Accurate and Compact Deep Neural Networks☆245Aug 30, 2022Updated 3 years ago
- implement of DoReFaNet with tensorflow based on cifar10 dataset☆28Nov 8, 2017Updated 8 years ago
- Reproduction of WAGE in PyTorch.☆44Nov 18, 2018Updated 7 years ago
- Single-Path NAS: Designing Hardware-Efficient ConvNets in less than 4 Hours☆394Dec 14, 2020Updated 5 years ago
- Awesome Knowledge-Distillation. 分类整理的知识蒸馏paper(2014-2021)。☆2,663May 30, 2023Updated 2 years ago
- This repo is about NAS☆26Nov 1, 2019Updated 6 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Awesome Knowledge Distillation☆3,866Mar 22, 2026Updated 2 months ago
- Latte is a convolutional neural network (CNN) inference engine written in C++ and uses AVX to vectorize operations. The engine runs on Wi…☆13Jun 25, 2018Updated 7 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- Revisiting Parameter Sharing for Automatic Neural Channel Number Search, NeurIPS 2020☆21Nov 15, 2020Updated 5 years ago
- Knowledge Distillation: CVPR2020 Oral, Revisiting Knowledge Distillation via Label Smoothing Regularization☆583Feb 15, 2023Updated 3 years ago
- A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and o…☆280Feb 23, 2023Updated 3 years ago
- our code☆33Jun 11, 2021Updated 4 years ago
- Implementation of PGONAS for CVPR22W and RD-NAS for ICASSP23☆23Apr 25, 2023Updated 3 years ago
- Benchmark framework of compute-in-memory based accelerators for deep neural network (inference engine focused)☆11Jun 1, 2021Updated 4 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- All about acceleration and compression of Deep Neural Networks☆33Nov 5, 2019Updated 6 years ago
- Topology Distillation for Recommender System (KDD'21)☆13Sep 2, 2021Updated 4 years ago
- ☆13Apr 10, 2017Updated 9 years ago
- The code for Channel-Level Variable Quantization Network for Deep Image Compression (IJCAI 2020)☆30Jul 23, 2021Updated 4 years ago
- Knowledge Distillation Algorithms implemented with PyTorch☆17Jul 23, 2019Updated 6 years ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,271May 6, 2025Updated last year
- PyTorch library to facilitate development and standardized evaluation of neural network pruning methods.☆435Jul 7, 2023Updated 2 years ago