OpenPPL / ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
☆1,654Updated 11 months ago
Alternatives and similar repositories for ppq:
Users that are interested in ppq are comparing it to the libraries listed below
- Simple samples for TensorRT programming☆1,587Updated last week
- Model Quantization Benchmark☆793Updated 2 months ago
- ☆1,019Updated last year
- A primitive library for neural network☆1,324Updated 3 months ago
- nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performan…☆713Updated this week
- ☆266Updated 2 years ago
- A simple network quantization demo using pytorch from scratch.☆521Updated last year
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,230Updated 3 years ago
- PyTorch Neural Network eXchange☆563Updated this week
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,446Updated 3 weeks ago
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆500Updated 4 months ago
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,559Updated 9 months ago
- Machine learning compiler based on MLIR for Sophgo TPU.☆694Updated last week
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆815Updated last week
- how to optimize some algorithm in cuda.☆2,022Updated this week
- OpenMMLab Model Deployment Framework☆2,889Updated 5 months ago
- compiler learning resources collect.☆2,317Updated 9 months ago
- C++ library based on tensorrt integration☆2,706Updated last year
- Simplify your onnx model☆4,002Updated 6 months ago
- A parser, editor and profiler tool for ONNX models.☆421Updated 2 months ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆940Updated 7 months ago
- Deploy your model with TensorRT quickly.☆765Updated last year
- Several simple examples for popular neural network toolkits calling custom CUDA operators.☆1,417Updated 3 years ago
- Pytorch-->onnx-->TensorRT; CUDA11, CUDNN8, TensorRT8☆207Updated last year
- Everything in Torch Fx☆342Updated 9 months ago
- ONNX-TensorRT: TensorRT backend for ONNX☆3,044Updated 2 weeks ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆480Updated 4 months ago
- yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.☆729Updated last month
- row-major matmul optimization☆611Updated last year
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,026Updated 2 weeks ago