PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
☆1,801Mar 28, 2024Updated 2 years ago
Alternatives and similar repositories for ppq
Users that are interested in ppq are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Model Quantization Benchmark☆869Apr 20, 2025Updated last year
- A primitive library for neural network☆1,371Nov 24, 2024Updated last year
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆514Oct 30, 2024Updated last year
- Simple samples for TensorRT programming☆1,659May 5, 2026Updated last month
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,626Nov 19, 2025Updated 6 months ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Simplify your onnx model☆4,346Jun 1, 2026Updated last week
- A simple tool that can generate TensorRT plugin code quickly.☆241Jul 11, 2023Updated 2 years ago
- OpenMMLab Model Deployment Framework☆3,126Sep 30, 2024Updated last year
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,269May 6, 2025Updated last year
- C++ library based on tensorrt integration☆2,881May 24, 2023Updated 3 years ago
- A list of papers, docs, codes about model quantization. This repo is aimed to provide the info for model quantization research, we are co…☆2,389May 11, 2026Updated 3 weeks ago
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,630Jun 1, 2026Updated last week
- A simple network quantization demo using pytorch from scratch.☆542Jun 18, 2023Updated 2 years ago
- MegCC是一个运行时超轻量,高效,移植简单的深度学习模型编译器☆482Oct 23, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,824Apr 25, 2026Updated last month
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆407Nov 22, 2022Updated 3 years ago
- compiler learning resources collect.☆2,744May 20, 2026Updated 2 weeks ago
- A model compression and acceleration toolbox based on pytorch.☆331Jan 12, 2024Updated 2 years ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆13,040Updated this week
- ☆140Apr 23, 2024Updated 2 years ago
- Offline Quantization Tools for Deploy.☆143Dec 28, 2023Updated 2 years ago
- Implementation of popular deep learning networks with TensorRT network definition API☆7,791May 20, 2026Updated 2 weeks ago
- how to optimize some algorithm in cuda.☆3,059May 25, 2026Updated 2 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Pytorch implementation of BRECQ, ICLR 2021☆300Aug 1, 2021Updated 4 years ago
- ☆150Jan 9, 2025Updated last year
- row-major matmul optimization☆727May 14, 2026Updated 3 weeks ago
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,311Sep 7, 2025Updated 9 months ago
- OpenMMLab Model Compression Toolbox and Benchmark.☆1,672Jun 11, 2024Updated last year
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,690May 28, 2026Updated last week
- ☆157Jan 20, 2024Updated 2 years ago
- Samples code for world class Artificial Intelligence SoCs for computer vision applications.☆295May 26, 2026Updated 2 weeks ago
- [IJCAI 2022] FQ-ViT: Post-Training Quantization for Fully Quantized Vision Transformer☆361Apr 11, 2023Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Transformer related optimization, including BERT, GPT☆6,419Mar 27, 2024Updated 2 years ago
- RepVGG: Making VGG-style ConvNets Great Again☆3,474Feb 10, 2023Updated 3 years ago
- [ICML 2023] SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models☆1,658Jul 12, 2024Updated last year
- A unified library of SOTA model optimization techniques like quantization, distillation, pruning, neural architecture search, speculative…☆2,891Updated this week
- ncnn is a high-performance neural network inference framework optimized for the mobile platform☆23,320May 30, 2026Updated last week
- Tengine is a lite, high performance, modular inference engine for embedded device☆4,525Mar 6, 2025Updated last year
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,411Mar 19, 2026Updated 2 months ago