OpenPPL / ppqLinks

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

☆1,714

Alternatives and similar repositories for ppq

Users that are interested in ppq are comparing it to the libraries listed below

Sorting:

NVIDIA / trt-samples-for-hackathon-cn
Simple samples for TensorRT programming
☆1,627Updated 2 months ago
ModelTC / MQBench
Model Quantization Benchmark
☆826Updated 3 months ago
Jermmy / pytorch-quantization-demo
A simple network quantization demo using pytorch from scratch.
☆536Updated 2 years ago
LitLeo / TensorRT_Tutorial
☆1,030Updated last year
alibaba / TinyNeuralNetwork
TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.
☆838Updated 2 months ago
ZhangGe6 / onnx-modifier
A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.
☆1,536Updated 5 months ago
nndeploy / nndeploy
Workflow-based Multi-platform AI Deployment Tool
☆1,109Updated this week
HeKun-NVIDIA / TensorRT-Developer_Guide_in_Chinese
☆292Updated 3 years ago
OpenPPL / ppl.nn
A primitive library for neural network
☆1,345Updated 8 months ago
666DZY666 / micronet
micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…
☆2,255Updated 2 months ago
BBuf / tvm_mlir_learn
compiler learning resources collect.
☆2,457Updated 4 months ago
sophgo / tpu-mlir
Machine learning compiler based on MLIR for Sophgo TPU.
☆765Updated last week
open-mmlab / mmdeploy
OpenMMLab Model Deployment Framework
☆3,000Updated 10 months ago
ThanatosShinji / onnx-tool
A parser, editor and profiler tool for ONNX models.
☆446Updated last month
godweiyang / NN-CUDA-Example
Several simple examples for popular neural network toolkits calling custom CUDA operators.
☆1,488Updated 4 years ago
open-mmlab / mmrazor
OpenMMLab Model Compression Toolbox and Benchmark.
☆1,608Updated last year
shouxieai / tensorRT_Pro
C++ library based on tensorrt integration
☆2,791Updated 2 years ago
pnnx / pnnx
PyTorch Neural Network eXchange
☆605Updated this week
OpenPPL / ppl.cv
ppl.cv is a high-performance image processing library of openPPL supporting various platforms.
☆507Updated 9 months ago
BBuf / how-to-optim-algorithm-in-cuda
how to optimize some algorithm in cuda.
☆2,345Updated this week
THU-MIG / torch-model-compression
针对pytorch模型的自动化模型结构分析和修改工具集，包含自动分析模型结构的模型压缩算法库
☆250Updated 2 years ago
HuPengsheet / use-ncnn
NCNN的代码学习，各种小Demo。
☆115Updated last year
zerollzeng / tiny-tensorrt
Deploy your model with TensorRT quickly.
☆768Updated last year
daquexian / onnx-simplifier
Simplify your onnx model
☆4,121Updated 10 months ago
onnx / onnx-tensorrt
ONNX-TensorRT: TensorRT backend for ONNX
☆3,124Updated last week
Laicheng0830 / Pytorch_Model_Quantization
OpenPose uses Pytorch for static quantization, saving, and loading of models
☆88Updated 4 years ago
agrechnev / trt-cpp-min
TensorRT 7 C++ (almost) minimal examples
☆82Updated last year
Tongkaio / CUDA_Kernel_Samples
CUDA 算子手撕与面试指南
☆511Updated 6 months ago
Tony-Tan / CUDA_Freshman
☆2,503Updated last year
PaddlePaddle / PaddleSlim
PaddleSlim is an open-source library for deep model compression and architecture search.
☆1,600Updated 3 weeks ago