A Toolkit to Help Optimize Onnx Model
☆460Mar 21, 2026Updated this week
Alternatives and similar repositories for OnnxSlim
Users that are interested in OnnxSlim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 4 months ago
- A Toolkit to Help Optimize Large Onnx Model☆164Oct 26, 2025Updated 5 months ago
- ☆12Feb 5, 2024Updated 2 years ago
- caffe model to onnx☆33Nov 16, 2022Updated 3 years ago
- katago benchmark☆14Mar 2, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- ☆18Jan 12, 2022Updated 4 years ago
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- Machine Learning, Facial Rigger☆32Jan 20, 2022Updated 4 years ago
- caffe to tensorrt☆17Jan 24, 2019Updated 7 years ago
- Cuda Version Image Processing API☆40Mar 17, 2019Updated 7 years ago
- mnn asr demo.☆26Mar 24, 2025Updated last year
- llm-export can export llm model to onnx.☆346Oct 24, 2025Updated 5 months ago
- Everything in Torch Fx☆344Jun 7, 2024Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Model compression for ONNX☆100Mar 1, 2026Updated 3 weeks ago
- ONNX Optimizer☆800Updated this week
- Simplify your onnx model☆4,314Updated this week
- mnn tts demo.☆19May 7, 2025Updated 10 months ago
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,619Nov 19, 2025Updated 4 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆14Oct 24, 2023Updated 2 years ago
- Export the STFT or ISTFT process in ONNX format.☆41Mar 16, 2026Updated last week
- NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software compone…☆86Mar 18, 2026Updated last week
- 🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime☆128Mar 12, 2026Updated 2 weeks ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆131Apr 24, 2025Updated 11 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆432Updated this week
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆11Aug 11, 2025Updated 7 months ago
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆2,218Updated this week
- ☆23Jan 3, 2024Updated 2 years ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated 11 months ago
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,767Mar 15, 2026Updated last week
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- Detect CPU features with single-file☆452Mar 5, 2026Updated 3 weeks ago
- Ultralytics LLM-related experiments☆86Mar 20, 2026Updated last week
- A Flutter plugin to use ncnn, a high-performance neural network inference framework optimized for the mobile platform.☆21Nov 30, 2023Updated 2 years ago
- ☆124Dec 15, 2023Updated 2 years ago
- ☆26Mar 18, 2026Updated last week