BBuf / model-compressionLinks
model compression based on pytorch (1、quantization: 8/4/2bits(dorefa)、ternary/binary value(twn/bnn/xnor-net);2、 pruning: normal、regular and group convolutional channel pruning;3、 group convolution structure;4、batch-normalization folding for binary value of feature(A))
☆170Updated 5 years ago
Alternatives and similar repositories for model-compression
Users that are interested in model-compression are comparing it to the libraries listed below
Sorting:
- 针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库☆251Updated 2 years ago
- A tutorial about how to build a TensorRT Engine from a PyTorch Model with the help of ONNX☆247Updated 4 years ago
- Pytorch-->onnx-->TensorRT; CUDA11, CUDNN8, TensorRT8☆212Updated last year
- Everything in Torch Fx☆345Updated last year
- ONNX2Pytorch☆162Updated 4 years ago
- pytorch AutoSlim tools,支持三行代码对pytorch模型进行剪枝压缩☆39Updated 4 years ago
- www.giantpandacv.com☆152Updated last year
- ResRep: Lossless CNN Pruning via Decoupling Remembering and Forgetting (ICCV 2021)☆297Updated 2 years ago
- A nnie quantization aware training tool on pytorch.☆239Updated 4 years ago
- Try out different pruning-approaches on lightweight Backbones.☆146Updated 5 years ago
- RM Operation can equivalently convert ResNet to VGG, which is better for pruning; and can help RepVGG perform better when the depth is la…☆211Updated 2 years ago
- A simple network quantization demo using pytorch from scratch.☆534Updated 2 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆402Updated 2 years ago
- An Improved One millisecond Mobile Backbone☆146Updated 3 years ago
- Learning Efficient Convolutional Networks through Network Slimming, In ICCV 2017.☆575Updated 6 years ago
- ☆337Updated 3 years ago
- ☆99Updated 4 years ago
- Personal Pytorch toy script.☆67Updated 3 years ago
- pytorch -> onnx -> caffe, pytorch to caffe, or other deep learning framework to onnx and onnx to caffe.☆163Updated 4 years ago
- Model Compression 1. Pruning(BN Pruning) 2. Knowledge Distillation (Hinton) 3. Quantization (MNN) 4. Deployment (MNN)☆79Updated 4 years ago
- An Object Detection Knowledge Distillation framework powered by pytorch, now having SSD and yolov5.☆225Updated 3 years ago
- ☆81Updated 4 years ago
- ghostnet_cifar10☆115Updated 4 years ago
- TensorRT-7 Network Lib 包括常用目标检测、关键点检测、人脸检测、OCR等 可训练自己数据☆532Updated 4 years ago
- RepVGG TensorRT int8 量化,实测推理不到1ms一帧!☆62Updated 4 years ago
- ☆125Updated 4 years ago
- NVIDIA TensorRT 加速推断教程!☆133Updated 4 years ago
- YOLO ModelCompression MultidatasetTraining☆443Updated 3 years ago
- 模型压缩demo(剪枝、量化、知识蒸馏)☆77Updated 5 years ago
- Code for some onnxruntime projects☆122Updated 4 years ago