daquexian / onnx-simplifierLinks
Simplify your onnx model
☆4,232Updated 3 months ago
Alternatives and similar repositories for onnx-simplifier
Users that are interested in onnx-simplifier are comparing it to the libraries listed below
Sorting:
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,574Updated last week
- ONNX-TensorRT: TensorRT backend for ONNX☆3,172Updated 3 weeks ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,894Updated this week
- An easy to use PyTorch to TensorRT converter☆4,833Updated last year
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,495Updated 2 months ago
- ONNX Optimizer☆774Updated 3 weeks ago
- OpenMMLab Model Deployment Framework☆3,074Updated last year
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,504Updated this week
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,771Updated last year
- Simple samples for TensorRT programming☆1,647Updated 6 months ago
- Examples for using ONNX Runtime for machine learning inferencing.☆1,539Updated last week
- Deploy your model with TensorRT quickly.☆765Updated 2 years ago
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,606Updated 2 weeks ago
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,403Updated 2 weeks ago
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,294Updated 3 months ago
- Implementation of popular deep learning networks with TensorRT network definition API☆7,576Updated this week
- Tensorflow Backend for ONNX☆1,326Updated last year
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,533Updated this week
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆858Updated 3 months ago
- ☆1,041Updated last year
- A parser, editor and profiler tool for ONNX models.☆468Updated 3 weeks ago
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,109Updated this week
- PyTorch Neural Network eXchange☆649Updated last week
- A primitive library for neural network☆1,368Updated last year
- Tutorials for creating and using ONNX models☆3,628Updated last year
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆954Updated 7 months ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,267Updated 6 months ago
- yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.☆731Updated 3 weeks ago
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆883Updated this week
- C++ library based on tensorrt integration☆2,829Updated 2 years ago