针对pytorch模型的自动化模型结构分析和修改工具集,包含自动分析模型结构的模型压缩算法库
☆258Apr 19, 2023Updated 3 years ago
Alternatives and similar repositories for torch-model-compression
Users that are interested in torch-model-compression are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- PrefixKV: Adaptive Prefix KV Cache is What Vision Instruction-Following Models Need for Efficient Generation [NeurIPS 2025]☆18Oct 11, 2025Updated 7 months ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,271May 6, 2025Updated last year
- 高效部署:YOLO X, V3, V4, V5, V6, V7, V8, EdgeYOLO TRT推理 ™️ ,前后处理均由CUDA核函数实现 CPP/CUDA🚀☆54Feb 23, 2023Updated 3 years ago
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆878Mar 3, 2026Updated 2 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- yolov3 yolov4 channel and layer pruning, Knowledge Distillation 层剪枝,通道剪枝,知识蒸馏☆1,514Sep 3, 2020Updated 5 years ago
- pytorch AutoSlim tools,支持三行代码对pytorch模型进行剪枝压缩☆39Apr 23, 2021Updated 5 years ago
- cpp rotation album,基于cpp eigen实现的3d旋转相册,GAMES101复现内容☆12Jul 25, 2022Updated 3 years ago
- On-the-fly Structured Pruning for PyTorch models. This library implements several attributions metrics and structured pruning utils for n…☆167Jun 4, 2020Updated 5 years ago
- Various test models in WNNX format. It can view with `pip install wnetron && wnetron`☆12Jun 22, 2022Updated 3 years ago
- mobilev2-yolov5s剪枝、蒸馏,支持ncnn,tensorRT部署。ultra-light but better performence!☆859Sep 26, 2022Updated 3 years ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,310Sep 7, 2025Updated 8 months ago
- ☆79May 16, 2023Updated 3 years ago
- YOLO ModelCompression MultidatasetTraining☆443Jun 21, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,671Jun 11, 2024Updated last year
- The c++ implement of Multi class ByteTrack☆14Mar 24, 2023Updated 3 years ago
- PyTorch Model Compression☆234Jan 27, 2023Updated 3 years ago
- ☆19May 28, 2020Updated 5 years ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,796Mar 28, 2024Updated 2 years ago
- PyTorch Quantization Aware Training(QAT,量化感知训练)☆44Oct 13, 2023Updated 2 years ago
- mobileNet SSD 基于caffe的前向检测☆10Nov 30, 2018Updated 7 years ago
- DEYOv1.5☆29Jul 22, 2024Updated last year
- 模型压缩demo(剪枝、量化、知识蒸馏)☆79Apr 20, 2020Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- PaddleSlim is an open-source library for deep model compression and architecture search.☆1,614Jan 4, 2026Updated 4 months ago
- More readable and flexible yolov5 with more backbone(gcn, resnet, shufflenet, moblienet, efficientnet, hrnet, swin-transformer, etc) and …☆692Aug 19, 2024Updated last year
- This repo is implemented based on detectron2 and centernet☆556Dec 31, 2020Updated 5 years ago
- TensorRT ONNX Plugin、Inference、Compile☆464Jul 26, 2021Updated 4 years ago
- yolov5 prune,Support V2, V3, V4 and V6 versions of yolov5☆582Jan 6, 2022Updated 4 years ago
- Bert TensorRT模型加速部署☆10Apr 1, 2022Updated 4 years ago
- The official implementation of our ECCV 2024 publication, PYRA (Parallel Yielding Re-Activation).☆22Dec 19, 2025Updated 5 months ago
- Deploy your model with TensorRT quickly.☆764Nov 21, 2023Updated 2 years ago
- quantize aware training package for NCNN on pytorch☆68Jul 27, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Experiments based on CenterNet (more backbones, TensorRT deployment and mask head)☆221Jun 16, 2020Updated 5 years ago
- RepVGG: Making VGG-style ConvNets Great Again☆3,474Feb 10, 2023Updated 3 years ago
- 本项目支持对剪枝后的yolov5模型进行知识蒸馏训练(This project supports knowledge distillation training for the pruned YOLOv5 model)☆107Jan 28, 2024Updated 2 years ago
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆408Nov 22, 2022Updated 3 years ago
- Support Yolov5(4.0)/Yolov5(5.0)/YoloR/YoloX/Yolov4/Yolov3/CenterNet/CenterFace/RetinaFace/Classify/Unet. use darknet/libtorch/pytorch/mxn…☆211Aug 2, 2021Updated 4 years ago
- yolo model qat and deploy with deepstream&tensorrt☆602Sep 25, 2024Updated last year
- pytorch implementation of Funnel Activation (FReLU)☆16Aug 16, 2020Updated 5 years ago