daquexian / onnx-simplifierLinks
Simplify your onnx model
☆4,248Updated 3 months ago
Alternatives and similar repositories for onnx-simplifier
Users that are interested in onnx-simplifier are comparing it to the libraries listed below
Sorting:
- ONNX-TensorRT: TensorRT backend for ONNX☆3,175Updated last month
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,904Updated last week
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,586Updated last month
- An easy to use PyTorch to TensorRT converter☆4,837Updated last year
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,500Updated 3 months ago
- Simple samples for TensorRT programming☆1,649Updated last week
- ONNX Optimizer☆779Updated last month
- OpenMMLab Model Deployment Framework☆3,086Updated last year
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,774Updated last year
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,519Updated last week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,473Updated last week
- Examples for using ONNX Runtime for machine learning inferencing.☆1,557Updated 3 weeks ago
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,620Updated last month
- Implementation of popular deep learning networks with TensorRT network definition API☆7,602Updated this week
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,320Updated last week
- Tensorflow Backend for ONNX☆1,327Updated last year
- Deploy your model with TensorRT quickly.☆764Updated 2 years ago
- ☆1,043Updated last year
- Tutorials for creating and using ONNX models☆3,635Updated last year
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,219Updated 3 months ago
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆955Updated 8 months ago
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆894Updated this week
- C++ library based on tensorrt integration☆2,837Updated 2 years ago
- A primitive library for neural network☆1,369Updated last year
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆861Updated 3 months ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,268Updated 7 months ago
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,544Updated this week
- TensorRT MODNet, YOLOv4, YOLOv3, SSD, MTCNN, and GoogLeNet☆1,785Updated 3 months ago
- yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.☆731Updated last month
- A parser, editor and profiler tool for ONNX models.☆469Updated last month