A Toolkit to Help Optimize Onnx Model
☆476May 5, 2026Updated this week
Alternatives and similar repositories for OnnxSlim
Users that are interested in OnnxSlim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 5 months ago
- A Toolkit to Help Optimize Large Onnx Model☆165Oct 26, 2025Updated 6 months ago
- ☆11Sep 30, 2019Updated 6 years ago
- ☆12Feb 5, 2024Updated 2 years ago
- caffe model to onnx☆33Nov 16, 2022Updated 3 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- katago benchmark☆14Mar 2, 2022Updated 4 years ago
- ☆18Jan 12, 2022Updated 4 years ago
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- Machine Learning, Facial Rigger☆33Jan 20, 2022Updated 4 years ago
- caffe to tensorrt☆17Jan 24, 2019Updated 7 years ago
- Cuda Version Image Processing API☆40Mar 17, 2019Updated 7 years ago
- mnn asr demo.☆26Mar 24, 2025Updated last year
- llm-export can export llm model to onnx.☆350Oct 24, 2025Updated 6 months ago
- Everything in Torch Fx☆343Jun 7, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- Model compression for ONNX☆101Updated this week
- ONNX Optimizer☆809Updated this week
- Simplify your onnx model☆4,331Apr 29, 2026Updated last week
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,625Nov 19, 2025Updated 5 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆15Oct 24, 2023Updated 2 years ago
- Export the STFT or ISTFT process in ONNX format.☆43Mar 16, 2026Updated last month
- 🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime☆145Apr 28, 2026Updated last week
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆140Apr 24, 2025Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆438Updated this week
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 8 months ago
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆2,591Updated this week
- ☆23Jan 3, 2024Updated 2 years ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- CTC decoder with hotwords for ASR.☆35Apr 13, 2025Updated last year
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,801Apr 25, 2026Updated last week
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- ☆32Jan 28, 2025Updated last year
- Detect CPU features with single-file☆456Apr 6, 2026Updated last month
- A Flutter plugin to use ncnn, a high-performance neural network inference framework optimized for the mobile platform.☆21Nov 30, 2023Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- PyTorch Neural Network eXchange☆705Apr 14, 2026Updated 3 weeks ago
- Ultralytics LLM-related experiments☆94Apr 24, 2026Updated last week