daquexian / onnx-simplifierLinks
Simplify your onnx model
☆4,165Updated 2 weeks ago
Alternatives and similar repositories for onnx-simplifier
Users that are interested in onnx-simplifier are comparing it to the libraries listed below
Sorting:
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,553Updated 6 months ago
- ONNX-TensorRT: TensorRT backend for ONNX☆3,149Updated last month
- PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT☆2,848Updated last week
- An easy to use PyTorch to TensorRT converter☆4,805Updated last year
- Convert TensorFlow, Keras, Tensorflow.js and Tflite models to ONNX☆2,469Updated last month
- OpenMMLab Model Deployment Framework☆3,031Updated 11 months ago
- ONNX Optimizer☆752Updated last month
- AIMET is a library that provides advanced quantization and compression techniques for trained neural network models.☆2,443Updated this week
- Simple samples for TensorRT programming☆1,636Updated 3 months ago
- Examples for using ONNX Runtime for machine learning inferencing.☆1,467Updated 2 weeks ago
- PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.☆1,745Updated last year
- Implementation of popular deep learning networks with TensorRT network definition API☆7,514Updated 4 months ago
- Tensorflow Backend for ONNX☆1,323Updated last year
- NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source compone…☆12,125Updated this week
- CV-CUDA™ is an open-source, GPU accelerated library for cloud-scale image processing and computer vision.☆2,578Updated 3 months ago
- Deploy your model with TensorRT quickly.☆768Updated last year
- C++ library based on tensorrt integration☆2,808Updated 2 years ago
- 🛠A lite C++ AI toolkit: 100+ models with MNN, ORT and TRT, including Det, Seg, Stable-Diffusion, Face-Fusion, etc.🎉☆4,234Updated 3 weeks ago
- Tutorials for creating and using ONNX models☆3,592Updated last year
- Convert ONNX models to PyTorch.☆698Updated last year
- SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX R…☆2,491Updated this week
- A primitive library for neural network☆1,350Updated 9 months ago
- TinyNeuralNetwork is an efficient and easy-to-use deep learning model compression framework.☆843Updated 3 weeks ago
- Neural Network Compression Framework for enhanced OpenVINO™ inference☆1,075Updated last week
- TensorRT MODNet, YOLOv4, YOLOv3, SSD, MTCNN, and GoogLeNet☆1,778Updated last week
- Self-Created Tools to convert ONNX files (NCHW) to TensorFlow/TFLite/Keras format (NHWC). The purpose of this tool is to solve the massiv…☆850Updated last month
- PyTorch Neural Network eXchange☆619Updated last week
- micronet, a model compression and deploy lib. compression: 1、quantization: quantization-aware-training(QAT), High-Bit(>2b)(DoReFa/Quantiz…☆2,256Updated 4 months ago
- ☆1,036Updated last year
- Bolt is a deep learning library with high performance and heterogeneous flexibility.☆954Updated 5 months ago