Model compression for ONNX
☆100Mar 1, 2026Updated 3 weeks ago
Alternatives and similar repositories for neural-compressor
Users that are interested in neural-compressor are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Mar 10, 2026Updated 2 weeks ago
- A simple Python tool to measure the performance of ONNX models.☆27Sep 15, 2024Updated last year
- This repository provides tutorial, which discusses running sample publisher and subscriber using multiple transports of point_cloud_trans…☆10Mar 17, 2026Updated last week
- ☆25Sep 19, 2025Updated 6 months ago
- [WIP] ONNX parts yard. The various operations described in Operator Schemas are converted in advance into OP stand-alone ONNX files.☆11Mar 30, 2025Updated 11 months ago
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- ONNX-compatible DeDoDe 🎶 Detect, Don't Describe - Describe, Don't Detect, for Local Feature Matching. Supports TensorRT 🚀☆82Aug 21, 2023Updated 2 years ago
- Common utilities for ONNX converters☆296Dec 16, 2025Updated 3 months ago
- Python scripts for performing monocular depth estimation using the SC_Depth model in ONNX☆33Nov 13, 2022Updated 3 years ago
- A Toolkit to Help Optimize Large Onnx Model☆164Oct 26, 2025Updated 5 months ago
- Implementation of YOLOv9 QAT optimized for deployment on TensorRT platforms.☆131Apr 24, 2025Updated 11 months ago
- SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, …☆2,598Mar 20, 2026Updated last week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆432Updated this week
- ggml学习笔记,ggml是一个机器学习的推理框架☆18Mar 24, 2024Updated 2 years ago
- An onnx-based quantitation tool.☆71Jan 8, 2024Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Simple tool for partial optimization of ONNX. Further optimize some models that cannot be optimized with onnx-optimizer and onnxsim by se…☆19May 7, 2024Updated last year
- ROS2 image transport plugin using libav(ffmpeg) for generating foxglove CompressedVideo messages☆11Oct 20, 2025Updated 5 months ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆179Mar 10, 2026Updated 2 weeks ago
- ONNX Optimizer☆800Updated this week
- ☆10Jul 18, 2024Updated last year
- [WACV 2026] Official implementation of the paper: “CountingDINO: A Training-free Pipeline for Exemplar-based Class-Agnostic Counting”☆55Mar 8, 2026Updated 2 weeks ago
- A Toolkit to Help Optimize Onnx Model☆460Updated this week
- ☆23Jan 3, 2024Updated 2 years ago
- NVIDIA TensorRT-RTX is an SDK for high-performance AI inference on NVIDIA RTX GPUs. This repository contains Open-Source Software compone…☆86Mar 18, 2026Updated last week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ffmpeg+cuvid+tensorrt+multicamera☆12Dec 31, 2024Updated last year
- LightNet-TRT is a high-efficiency and real-time implementation of convolutional neural networks (CNNs) using Edge AI.☆75Oct 3, 2023Updated 2 years ago
- A Gradio component that can be used to annotate images with bounding boxes.☆66Jan 24, 2026Updated 2 months ago
- Converts CLIP models to ONNX☆10Jan 17, 2023Updated 3 years ago
- Python scripts performing Open Vocabulary Object Detection using the YOLO-World model in ONNX.☆62Apr 7, 2024Updated last year
- BoT-SORT + YOLOX implemented using only onnxruntime, Numpy and scipy, without cython_bbox and PyTorch. Fast human tracker. OSNet is not u…☆48Jan 24, 2024Updated 2 years ago
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 4 months ago
- JAX bindings for the flash-attention3 kernels☆21Jan 2, 2026Updated 2 months ago
- ONNX-compatible LightGlue: Local Feature Matching at Light Speed☆28Jul 5, 2023Updated 2 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- ☆30May 1, 2022Updated 3 years ago
- (unofficial)(2025)RANSAC Revisited: An Improved Algorithm for Robust Subspace Recovery under Adversarial and Noisy Corruptions☆30Aug 12, 2025Updated 7 months ago
- Productionize machine learning predictions, with ONNX or without☆66Jan 11, 2024Updated 2 years ago
- Serving Inside Pytorch☆171Feb 3, 2026Updated last month
- nav2-keepout-zone-map-creator is a tool that allows you to create a Keepout Zone map from an Occupancy Grid Map and 3D point cloud.☆27Apr 24, 2024Updated last year
- Deep insight tensorrt, including but not limited to qat, ptq, plugin, triton_inference, cuda☆23Mar 8, 2026Updated 2 weeks ago
- A ROS2 wrapper to use the HuNavSim with the Gazebo Simulator☆28Jan 8, 2026Updated 2 months ago