A Toolkit to Help Optimize Onnx Model
☆484May 18, 2026Updated last week
Alternatives and similar repositories for OnnxSlim
Users that are interested in OnnxSlim are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 6 months ago
- A Toolkit to Help Optimize Large Onnx Model☆164Oct 26, 2025Updated 7 months ago
- ☆11Sep 30, 2019Updated 6 years ago
- caffe model to onnx☆33Nov 16, 2022Updated 3 years ago
- katago benchmark☆14Mar 2, 2022Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- ☆18Jan 12, 2022Updated 4 years ago
- llm deploy project based onnx.☆49Oct 9, 2024Updated last year
- Machine Learning, Facial Rigger☆33Jan 20, 2022Updated 4 years ago
- caffe to tensorrt☆17Jan 24, 2019Updated 7 years ago
- Cuda Version Image Processing API☆40Mar 17, 2019Updated 7 years ago
- mnn asr demo.☆27Mar 24, 2025Updated last year
- llm-export can export llm model to onnx.☆352May 8, 2026Updated 2 weeks ago
- Everything in Torch Fx☆343Jun 7, 2024Updated last year
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model compression for ONNX☆101May 1, 2026Updated 3 weeks ago
- ONNX Optimizer☆814May 3, 2026Updated 3 weeks ago
- Simplify your onnx model☆4,342Apr 29, 2026Updated 3 weeks ago
- mnn tts demo.☆19May 7, 2025Updated last year
- A tool to modify ONNX models in a visualization fashion, based on Netron and Flask.☆1,628Nov 19, 2025Updated 6 months ago
- Quantize yolov5 using pytorch_quantization.🚀🚀🚀☆15Oct 24, 2023Updated 2 years ago
- NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software compone…☆99Mar 18, 2026Updated 2 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆139Apr 24, 2025Updated last year
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆439Updated this week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX. And Export the ONNX model for AXera's NPU☆12Aug 11, 2025Updated 9 months ago
- A unified library of SOTA model optimization techniques like quantization, pruning, distillation, speculative decoding, etc. It compresse…☆2,750Updated this week
- ☆23Jan 3, 2024Updated 2 years ago
- Demonstrate Yolov9 model with Qualcomm Hexagon NPU and DirectML☆12Nov 27, 2024Updated last year
- CTC decoder with hotwords for ASR.☆36Apr 13, 2025Updated last year
- 一款简单易用和高性能的AI部署框架 | An Easy-to-Use and High-Performance AI Deployment Framework☆1,821Apr 25, 2026Updated last month
- A simple neural network inference framework☆25Aug 1, 2023Updated 2 years ago
- 用于学习GOT/Qwen/OnnxLLm☆55Oct 8, 2024Updated last year
- 使用mnn-llm对GOT-OCR2.0进行推理☆14Oct 2, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Detect CPU features with single-file☆456Apr 6, 2026Updated last month
- A Flutter plugin to use ncnn, a high-performance neural network inference framework optimized for the mobile platform.☆21Nov 30, 2023Updated 2 years ago
- ☆125Dec 15, 2023Updated 2 years ago
- PyTorch Neural Network eXchange☆707May 19, 2026Updated last week
- Ultralytics LLM-related experiments☆95May 18, 2026Updated last week
- A tool for parsing, editing, optimizing, and profiling ONNX models.☆491May 18, 2026Updated last week
- ☆19Jan 19, 2024Updated 2 years ago