Adlik / model_optimizerLinks
☆38Updated 2 years ago
Alternatives and similar repositories for model_optimizer
Users that are interested in model_optimizer are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- Adlik: Toolkit for Accelerating Deep Learning Inference☆810Updated 2 years ago
- Model optimizer used in Adlik.☆42Updated 2 years ago
- Model Quantization Benchmark☆857Updated 9 months ago
- ppl.cv is a high-performance image processing library of openPPL supporting various platforms.☆515Updated last year
- A parser, editor and profiler tool for ONNX models.☆478Updated 3 months ago
- TensorRT Plugin Autogen Tool☆366Updated 2 years ago
- A primitive library for neural network☆1,368Updated last year
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,781Updated last year
- EasyQuant(EQ) is an efficient and simple post-training quantization method via effectively optimizing the scales of weights and activatio…☆408Updated 3 years ago
- Everything in Torch Fx☆345Updated last year
- NART = NART is not A RunTime, a deep learning inference framework.☆37Updated 2 years ago
- ☆33Updated 2 years ago
- Offline Quantization Tools for Deploy.☆142Updated 2 years ago
- A simple network quantization demo using pytorch from scratch.☆542Updated 2 years ago
- ⚡ Useful scripts when using TensorRT☆237Updated 5 years ago
- BladeDISC is an end-to-end DynamIc Shape Compiler project for machine learning workloads.☆916Updated last year
- Simple samples for TensorRT programming☆1,658Updated 2 weeks ago
- ☆1,047Updated last year
- benchmark for embededded-ai deep learning inference engines, such as NCNN / TNN / MNN / TensorFlow Lite etc.☆203Updated 4 years ago
- A sample for onnxparser working with trt user defined plugins for TRT7.0☆171Updated 5 years ago
- A flexible and efficient deep neural network (DNN) compiler that generates high-performance executable from a DNN model description.☆1,006Updated last year
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆956Updated 9 months ago
- [EMNLP 2024 & AAAI 2026] A powerful toolkit for compressing large models including LLMs, VLMs, and video generative models.☆672Updated 2 months ago
- A nnie quantization aware training tool on pytorch.☆238Updated 5 years ago
- Yinghan's Code Sample☆365Updated 3 years ago
- ONNX2Pytorch☆165Updated 4 years ago
- A DNN inference latency prediction toolkit for accurately modeling and predicting the latency on diverse edge devices.☆364Updated last year
- Quantization library for PyTorch. Support low-precision and mixed-precision quantization, with hardware implementation through TVM.☆453Updated 2 years ago
- ☆314Updated 3 years ago