daquexian / onnx-simplifierLinks
Simplify your onnx model
☆4,240Updated 3 months ago
Alternatives and similar repositories for onnx-simplifier
Users that are interested in onnx-simplifier are comparing it to the libraries listed below
Sorting:
- ONNX-TensorRT: TensorRT backend for ONNX☆3,173Updated last month
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,583Updated 3 weeks ago
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,895Updated this week
- An easy to use PyTorch to TensorRT converter☆4,836Updated last year
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,499Updated 2 months ago
- ONNX Optimizer☆781Updated last month
- OpenMMLab Model Deployment Framework☆3,080Updated last year
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,513Updated this week
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,772Updated last year
- Simple samples for TensorRT programming☆1,648Updated 2 weeks ago
- Tensorflow Backend for ONNX☆1,327Updated last year
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,109Updated this week
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,615Updated 3 weeks ago
- Deploy your model with TensorRT quickly.☆764Updated 2 years ago
- Examples for using ONNX Runtime for machine learning inferencing.☆1,552Updated last week
- Implementation of popular deep learning networks with TensorRT network definition API☆7,579Updated last week
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,445Updated this week
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,312Updated last week
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆954Updated 7 months ago
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆889Updated 2 weeks ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆860Updated 3 months ago
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,543Updated this week
- Convert ONNX models to PyTorch.☆710Updated last month
- [CVPR 2023] DepGraph: Towards Any Structural Pruning; LLMs, Vision Foundation Models, etc.☆3,203Updated 3 months ago
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,269Updated 7 months ago
- A parser, editor and profiler tool for ONNX models.☆469Updated last month
- ☆1,041Updated last year
- TNN: developed by Tencent Youtu Lab and Guangying Lab, a uniform deep learning inference framework for mobile、desktop and server. TNN is …☆4,595Updated 7 months ago
- PyTorch Neural Network eXchange☆656Updated 2 weeks ago
- yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.☆731Updated last month