inisis / OnnxSlimView external linksLinks
A Toolkit to Help Optimize Onnx Model
☆421Updated this week
Alternatives and similar repositories for OnnxSlim
Users that are interested in OnnxSlim are comparing it to the libraries listed below
Sorting:
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 2 months ago
- A Toolkit to Help Optimize Large Onnx Model☆164Oct 26, 2025Updated 3 months ago
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- ☆11Sep 30, 2019Updated 6 years ago
- caffe model to onnx☆33Nov 16, 2022Updated 3 years ago
- llm-export can export llm model to onnx.☆344Oct 24, 2025Updated 3 months ago
- Model compression for ONNX☆99Nov 18, 2024Updated last year
- ONNX Optimizer☆797Feb 4, 2026Updated last week
- Simplify your onnx model☆4,294Jan 29, 2026Updated 2 weeks ago
- katago benchmark☆14Mar 2, 2022Updated 3 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- ☆18Jan 12, 2022Updated 4 years ago
- mnn asr demo.☆25Mar 24, 2025Updated 10 months ago
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Apr 24, 2025Updated 9 months ago
- NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software compone…☆83Dec 19, 2025Updated last month
- Everything in Torch Fx☆345Jun 7, 2024Updated last year
- mnn tts demo.☆19May 7, 2025Updated 9 months ago
- 🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime☆116Updated this week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆422Updated this week
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 2 months ago
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆1,964Updated this week
- A Flutter plugin to use ncnn, a high-performance neural network inference framework optimized for the mobile platform.☆21Nov 30, 2023Updated 2 years ago
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,728Updated this week
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 6 months ago
- 用于学习GOT/Qwen/OnnxLLm☆53Oct 8, 2024Updated last year
- ☆23Jan 3, 2024Updated 2 years ago
- PyTorch Neural Network eXchange☆674Jan 30, 2026Updated 2 weeks ago
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- caffe to tensorrt☆17Jan 24, 2019Updated 7 years ago
- Cuda Version Image Processing API☆40Mar 17, 2019Updated 6 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- A parser, editor and profiler tool for ONNX models.☆480Nov 3, 2025Updated 3 months ago
- ☆125Dec 15, 2023Updated 2 years ago
- Machine Learning, Facial Rigger☆32Jan 20, 2022Updated 4 years ago
- SuperPoint and LightGlue with TensorRT. Deploy with C++.☆21Dec 14, 2023Updated 2 years ago
- Simple inference for Vits2 TTS Using ONNXRUNTIME and espeak-ng on C++☆18Apr 17, 2024Updated last year