HuangCongQing / model-compression-optimizationLinks
model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏,量化,剪枝)
☆18Updated last year
Alternatives and similar repositories for model-compression-optimization
Users that are interested in model-compression-optimization are comparing it to the libraries listed below
Sorting:
- The official (TMLR) implementation of LumiNet: Perception-Driven Knowledge Distillation via Statistical Logit Calibration☆17Updated 2 months ago
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆73Updated 2 years ago
- A minimal Tensorflow2.0 implementation of Resnet on CIFAR10 dataset.☆13Updated 4 years ago
- ☆35Updated 2 years ago
- ☆48Updated 3 years ago
- Implementation of Conv-based and Vit-based networks designed for CIFAR.☆70Updated 2 years ago
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆68Updated 3 years ago
- An onnx-based quantitation tool.☆71Updated last year
- channel pruning for accelerating very deep neural networks☆13Updated 4 years ago
- [CVPRW 2021] Dynamic-OFA: Runtime DNN Architecture Switching for Performance Scaling on Heterogeneous Embedded Platforms☆30Updated 2 years ago
- Model Compression 1. Pruning(BN Pruning) 2. Knowledge Distillation (Hinton) 3. Quantization (MNN) 4. Deployment (MNN)☆79Updated 4 years ago
- 模型压缩demo(剪枝、量化、知识蒸馏)☆77Updated 5 years ago
- Jupyter notebook tutorials for MMDeploy☆36Updated 2 years ago
- EQ-Net [ICCV 2023]☆30Updated 2 years ago
- YOLO Series☆14Updated 2 years ago
- The official PyTorch implementation of CHEX: CHannel EXploration for CNN Model Compression (CVPR 2022). Paper is available at https://ope…☆38Updated 3 years ago
- provide some new architecture, channel pruning and quantization methods for yolov5☆29Updated 2 weeks ago
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆254Updated 2 years ago
- Quantize pytorch model, support post-training quantization and quantization aware training methods☆14Updated 2 years ago
- [ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization☆28Updated last year
- Quantize,Pytorch,Vgg16,MobileNet☆43Updated 4 years ago
- YOLOv5 Quantization Aware Training (QAT, qat_torch branch) and Post Training Quantization with ONNX (ptq_onnx branch ptq_onnx.ipynb)☆16Updated 2 years ago
- Pytorch implementation of our paper accepted by IEEE TNNLS, 2022 — Carrying out CNN Channel Pruning in a White Box☆18Updated 3 years ago
- [ICML 2022] "DepthShrinker: A New Compression Paradigm Towards Boosting Real-Hardware Efficiency of Compact Neural Networks", by Yonggan …☆35Updated 3 years ago
- B站Efficient-Neural-Network学习分享的配套代码☆302Updated 3 years ago
- 用于 MobileNetV3 在自定义数据集上的量化,模型压缩90%而精度几乎不受影响,论文:HAQ: Hardware-Aware Automated Quantization with Mixed Precision☆19Updated 4 years ago
- 🔨🔨🔨(mmplot)used to draw graphs of multiple index parameters such as algorithm accuracy and speed of multiple deep learning models.☆85Updated last year
- Quantize yolov7 using pytorch_quantization.🚀🚀🚀☆10Updated 2 years ago
- Official implementation for "Knowledge Distillation with Refined Logits".☆18Updated last year
- DeiT implementation for Q-ViT☆25Updated 6 months ago