A Toolkit to Help Optimize Onnx Model
☆446Mar 4, 2026Updated this week
Alternatives and similar repositories for OnnxSlim
Users that are interested in OnnxSlim are comparing it to the libraries listed below
Sorting:
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- A Toolkit to Help Optimize Large Onnx Model☆165Oct 26, 2025Updated 4 months ago
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- ☆11Sep 30, 2019Updated 6 years ago
- caffe model to onnx☆33Nov 16, 2022Updated 3 years ago
- ☆12Feb 5, 2024Updated 2 years ago
- llm-export can export llm model to onnx.☆344Oct 24, 2025Updated 4 months ago
- Model compression for ONNX☆100Updated this week
- ONNX Optimizer☆798Updated this week
- Simplify your onnx model☆4,304Feb 26, 2026Updated last week
- katago benchmark☆14Mar 2, 2022Updated 4 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- mnn asr demo.☆25Mar 24, 2025Updated 11 months ago
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,614Nov 19, 2025Updated 3 months ago
- CTC decoder with hotwords for ASR.☆34Apr 13, 2025Updated 10 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Apr 24, 2025Updated 10 months ago
- Everything in Torch Fx☆345Jun 7, 2024Updated last year
- mnn tts demo.☆19May 7, 2025Updated 10 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆429Updated this week
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- Export the STFT or ISTFT process in ONNX format.☆40Nov 21, 2025Updated 3 months ago
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆2,078Updated this week
- A Flutter plugin to use ncnn, a high-performance neural network inference framework optimized for the mobile platform.☆21Nov 30, 2023Updated 2 years ago
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,751Feb 23, 2026Updated last week
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 6 months ago
- ☆23Jan 3, 2024Updated 2 years ago
- 用于学习GOT/Qwen/OnnxLLm☆53Oct 8, 2024Updated last year
- PyTorch Neural Network eXchange☆683Feb 27, 2026Updated last week
- ☆15Mar 31, 2025Updated 11 months ago
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- caffe to tensorrt☆17Jan 24, 2019Updated 7 years ago
- Cuda Version Image Processing API☆40Mar 17, 2019Updated 6 years ago
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆27Apr 23, 2024Updated last year
- A tool for parsing, editing, optimizing, and profiling ONNX models.☆480Feb 10, 2026Updated 3 weeks ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆447Feb 25, 2026Updated last week
- Machine Learning, Facial Rigger☆32Jan 20, 2022Updated 4 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- SuperPoint and LightGlue with TensorRT. Deploy with C++.☆22Dec 14, 2023Updated 2 years ago