Model compression for ONNX
☆100Mar 1, 2026Updated this week
Alternatives and similar repositories for neural-compressor
Users that are interested in neural-compressor are comparing it to the libraries listed below
Sorting:
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- ☆25Sep 19, 2025Updated 5 months ago
- Python scripts for performing monocular depth estimation using the SC_Depth model in ONNX☆32Nov 13, 2022Updated 3 years ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆129Apr 24, 2025Updated 10 months ago
- Simple tool for partial optimization of ONNX. Further optimize some models that cannot be optimized with onnx-optimizer and onnxsim by se…☆19May 7, 2024Updated last year
- A Toolkit to Help Optimize Large Onnx Model☆165Oct 26, 2025Updated 4 months ago
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆11Feb 24, 2026Updated last week
- ONNX-compatible DeDoDe 🎶 Detect, Don't Describe - Describe, Don't Detect, for Local Feature Matching. Supports TensorRT 🚀☆81Aug 21, 2023Updated 2 years ago
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated last year
- Merging YOLOv9 and DepthAnythingV2☆30Jun 28, 2025Updated 8 months ago
- Common utilities for ONNX converters☆295Dec 16, 2025Updated 2 months ago
- YOLOX-ti-lite models exportable to TFLite☆22Mar 27, 2023Updated 2 years ago
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,590Updated this week
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- ONNX Optimizer☆797Feb 4, 2026Updated last month
- [WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆48Nov 10, 2025Updated 3 months ago
- Convert ANY IR to ONNX format☆25Feb 12, 2026Updated 2 weeks ago
- A Toolkit to Help Optimize Onnx Model☆442Feb 26, 2026Updated last week
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Apr 7, 2024Updated last year
- A Gradio component that can be used to annotate images with bounding boxes.☆66Jan 24, 2026Updated last month
- NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software compone…☆83Dec 19, 2025Updated 2 months ago
- BoT-SORT + YOLOX implemented using only onnxruntime, Numpy and scipy, without cython_bbox and PyTorch. Fast human tracker. OSNet is not u…☆46Jan 24, 2024Updated 2 years ago
- ☆23Jan 3, 2024Updated 2 years ago
- ONNX-compatible LightGlue: Local Feature Matching at Light Speed☆28Jul 5, 2023Updated 2 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- Productionize machine learning predictions, with ONNX or without☆66Jan 11, 2024Updated 2 years ago
- ☆40May 22, 2023Updated 2 years ago
- A tool convert TensorRT engine/plan to a fake onnx☆41Nov 22, 2022Updated 3 years ago
- ROS2 image transport plugin using libav(ffmpeg) for generating foxglove CompressedVideo messages☆11Oct 20, 2025Updated 4 months ago
- 処理の検証や比較検討での用途を想定したノードエディターベースの画像処理アプリ☆11Mar 5, 2023Updated 3 years ago
- Inference deployment of the llama3☆11Apr 21, 2024Updated last year
- ☆10Jul 18, 2024Updated last year
- the original reference implementation of a specified llama.cpp backend for Qualcomm Hexagon NPU on Android phone, https://github.com/ggml…☆36Jul 14, 2025Updated 7 months ago
- RKNN模型推理部署模板☆24Aug 11, 2023Updated 2 years ago
- llm deploy project based onnx.☆50Oct 9, 2024Updated last year
- Converts CLIP models to ONNX☆11Jan 17, 2023Updated 3 years ago
- Python scripts performing object detection using the YOLOv8 model in ONNX.☆11Apr 18, 2023Updated 2 years ago
- Python scripts performing optical flow estimation using the NeuFlowV2 model in ONNX.☆52Sep 15, 2024Updated last year
- only contain face detect 、5/81 points 、face recognization models☆11Jul 9, 2020Updated 5 years ago