HuangCongQing / model-compression-optimizationLinks
model compression and optimization for deployment for Pytorch, including knowledge distillation, quantization and pruning.(知识蒸馏,量化,剪枝)
☆18Updated last year
Alternatives and similar repositories for model-compression-optimization
Users that are interested in model-compression-optimization are comparing it to the libraries listed below
Sorting:
- [NeurIPS 2023] MCUFormer: Deploying Vision Transformers on Microcontrollers with Limited Memory☆72Updated last year
- ☆35Updated 2 years ago
- EQ-Net [ICCV 2023]☆30Updated 2 years ago
- The official (TMLR) implementation of LumiNet: Perception-Driven Knowledge Distillation via Statistical Logit Calibration☆17Updated last month
- Model Compression 1. Pruning(BN Pruning) 2. Knowledge Distillation (Hinton) 3. Quantization (MNN) 4. Deployment (MNN)☆79Updated 4 years ago
- provide some new architecture, channel pruning and quantization methods for yolov5☆29Updated last month
- Quantize pytorch model, support post-training quantization and quantization aware training methods☆14Updated 2 years ago
- RL-Pruner: Structured Pruning Using Reinforcement Learning for CNN Compression and Acceleration☆25Updated 4 months ago
- ☆48Updated 3 years ago
- Implementation of Conv-based and Vit-based networks designed for CIFAR.☆70Updated 2 years ago
- [ICCV-2023] EMQ: Evolving Training-free Proxies for Automated Mixed Precision Quantization☆28Updated last year
- Quantize,Pytorch,Vgg16,MobileNet☆42Updated 4 years ago
- TensorRT 2022 亚军方案,tensorrt加速mobilevit模型☆68Updated 3 years ago
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆253Updated 2 years ago
- [ICML 2023] This project is the official implementation of our accepted ICML 2023 paper BiBench: Benchmarking and Analyzing Network Binar…☆55Updated last year
- 模型压缩demo(剪枝、量化、知识蒸馏)☆77Updated 5 years ago
- Official implementation for ECCV 2022 paper LIMPQ - "Mixed-Precision Neural Network Quantization via Learned Layer-wise Importance"☆61Updated 2 years ago
- ☆36Updated 2 years ago
- YOLOv5 Quantization Aware Training (QAT, qat_torch branch) and Post Training Quantization with ONNX (ptq_onnx branch ptq_onnx.ipynb)☆16Updated 2 years ago
- An onnx-based quantitation tool.☆71Updated last year
- ☆15Updated 5 years ago
- [ICLR 2022] The Unreasonable Effectiveness of Random Pruning: Return of the Most Naive Baseline for Sparse Training by Shiwei Liu, Tianlo…☆77Updated 2 years ago
- [CVPR 2024] PTQ4SAM: Post-Training Quantization for Segment Anything☆79Updated last year
- [ICLR 2023] Pruning Deep Neural Networks from a Sparsity Perspective☆25Updated last year
- The official implementation of paper PreNAS: Preferred One-Shot Learning Towards Efficient Neural Architecture Search☆30Updated 2 years ago
- The official PyTorch implementation of CHEX: CHannel EXploration for CNN Model Compression (CVPR 2022). Paper is available at https://ope…☆38Updated 3 years ago
- ☆12Updated 2 years ago
- [T-PAMI'23] PAGCP for the compression of YOLOv5☆121Updated 2 years ago
- Code for ICCV23 paper "Automatic network pruning via Hilbert Schmidt independence criterion lasso under information bottleneck principle"☆18Updated 2 years ago
- ☆19Updated 3 years ago