HuangCongQing / model-compression-optimizationLinks

model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏，量化，剪枝)

☆18

Alternatives and similar repositories for model-compression-optimization

Users that are interested in model-compression-optimization are comparing it to the libraries listed below

Sorting:

liangyn22 / MCUFormer
[NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory
☆70Updated last year
ismail31416 / LumiNet
The official implementation of LumiNet: The Bright Side of Perceptual Knowledge Distillation https://arxiv.org/abs/2310.03669
☆19Updated last year
ososos888 / prune-then-distill
☆48Updated 2 years ago
TommyZihao / MMDeploy_Tutorials
Jupyter notebook tutorials for MMDeploy
☆35Updated 2 years ago
xuke225 / EQ-Net
EQ-Net [ICCV 2023]
☆30Updated last year
tangchen2 / Model-Compression
Model Compression 1. Pruning(BN Pruning) 2. Knowledge Distillation (Hinton) 3. Quantization (MNN) 4. Deployment (MNN)
☆79Updated 4 years ago
Cydia2018 / YOLOv5-Light
provide some new architecture, channel pruning and quantization methods for yolov5
☆29Updated this week
SteveTsui / Q-DETR
☆35Updated last year
sesmfs / onnx_quant_tool
An onnx-based quantitation tool.
☆71Updated last year
thb1314 / maskrcnn-tensorrt
☆47Updated 2 years ago
lliai / EMQ-series
[ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization
☆26Updated last year
cqu20160901 / DETR_onnx_tensorRT_V2
DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。
☆12Updated last year
zju-SWJ / RLD
Official implementation for "Knowledge Distillation with Refined Logits".
☆14Updated 11 months ago
HankYe / PAGCP
[T-PAMI'23] PAGCP for the compression of YOLOv5
☆118Updated 2 years ago
Beryex / RLPruner-CNN
RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration
☆21Updated last month
UoS-EEC / DynamicOFA
[CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms
☆29Updated 2 years ago
chenlamei / MobileVit_TensorRT
TensorRT 2022 亚军方案，tensorrt加速mobilevit模型
☆68Updated 3 years ago
THU-MIG / torch-model-compression
针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库
☆249Updated 2 years ago
xu-peng-tao / SSD-Pruning-and-quantization
Pruning and quantization for SSD. Model compression.
☆30Updated 4 years ago
Susan19900316 / yolov5_tensorrt_int8
yolov5 tensorrt int8量化方法汇总
☆77Updated last year
TanayNarshana / DFPC-Pruning
[ICLR 2023] PyTorch code for DFPC: Data flow driven pruning of coupled channels without data.
☆13Updated last year
thb1314 / mmyolo_tensorrt
☆145Updated last year
yl-jiang / YOLOSeries
YOLO Series
☆14Updated last year
zejiangh / Filter-GaP
The official PyTorch implementation of CHEX: CHannel EXploration for CNN Model Compression (CVPR 2022). Paper is available at https://ope…
☆38Updated 3 years ago
pprp / PyTorch-CIFAR-Model-Hub
Implementation of Conv-based and Vit-based networks designed for CIFAR.
☆70Updated 2 years ago
tinyvision / PreNAS
The official implementation of paper PreNAS: Preferred One-Shot Learning Towards Efficient Neural Architecture Search
☆29Updated last year
hisrg / Onnx-python
This repository is Onnx tutorial summary for python implements , which comes from other web resource.
☆29Updated 2 years ago
bytedance / MRECG
☆35Updated 2 years ago
GATECH-EIC / DepthShrinker
[ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …
☆35Updated 3 years ago
hustvl / PD-Quant
[CVPR 2023] PD-Quant: Post-Training Quantization Based on Prediction Difference Metric
☆56Updated 2 years ago