ONNX Serving is a project written with C++ to serve onnx-mlir compiled models with GRPC and other protocols.Benefiting from C++ implementation, ONNX Serving has very low latency overhead and high throughput. ONNX Servring provides dynamic batch aggregation and workers pool to fully utilize AI accelerators on the machine.
☆26Sep 17, 2025Updated 5 months ago
Alternatives and similar repositories for onnx-mlir-serving
Users that are interested in onnx-mlir-serving are comparing it to the libraries listed below
Sorting:
- Scoreboard for ONNX Backend Compatibility☆29Jan 24, 2026Updated last month
- edge/mobile transformer based Vision DNN inference benchmark☆16Aug 29, 2025Updated 6 months ago
- The Gstreamer hardware encoder/decoder plugins for Rockchip platform☆13Oct 8, 2023Updated 2 years ago
- Notes and artifacts from the ONNX steering committee☆28Feb 26, 2026Updated last week
- Tutorial on how to convert machine learned models into ONNX☆15Mar 11, 2023Updated 2 years ago
- Representation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure☆977Feb 24, 2026Updated last week
- Remote source nodes for NNStreamer pipelines without GStreamer dependencies☆17Jan 22, 2026Updated last month
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆429Updated this week
- Convert ANY IR to ONNX format☆25Feb 12, 2026Updated 3 weeks ago
- Common utilities for ONNX converters☆295Dec 16, 2025Updated 2 months ago
- Repository for ONNX SIG artifacts☆26Feb 14, 2026Updated 2 weeks ago
- example of using CoreML from c++☆24Jun 14, 2023Updated 2 years ago
- PyTorch -> ONNX -> TVM for autotuning☆24Feb 28, 2020Updated 6 years ago
- Build TensorFlow Lite runtime with GitHub Actions☆27Jul 25, 2025Updated 7 months ago
- A web app to convert a PyMOL PSE file or PDB file to a easy to implement NGL.js view that can be implemented easily on any site☆28Jan 31, 2023Updated 3 years ago
- A SapientML plugin of SapientMLGenerator☆11Dec 23, 2025Updated 2 months ago
- Efficient in-memory representation for ONNX, in Python☆42Feb 25, 2026Updated last week
- ☆18Jan 12, 2026Updated last month
- ☆12Nov 7, 2022Updated 3 years ago
- EdgeCortix maintained and extended fork of Apache TVM compiler stack utilized by MERA framework. TVM is an open deep learning compiler st…☆11Dec 22, 2023Updated 2 years ago
- ☆20Updated this week
- TL2cgen (TreeLite 2 C GENerator) is a model compiler for decision tree models☆46Feb 23, 2026Updated last week
- Tools for storing, search and analyze GC/MS spectra☆11Dec 19, 2024Updated last year
- ☆10Jul 12, 2017Updated 8 years ago
- Benchmark scripts for TVM☆74Mar 15, 2022Updated 3 years ago
- Visualize ONNX models with model-explorer☆69Feb 13, 2026Updated 2 weeks ago
- Tools related to the Genomics of Drug Sensitivity in Cancer (GDSC) projects (http://www.cancerrxgene.org/ )☆36Jan 6, 2022Updated 4 years ago
- Tensorflow Lite external delegate based on TIM-VX☆48Nov 25, 2025Updated 3 months ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆178Feb 19, 2026Updated 2 weeks ago
- Continuous quality evaluation of ML algorithms via CI/CD and GitHub Actions.☆16Jan 15, 2020Updated 6 years ago
- Notes and samples for Python performance talk☆10Feb 17, 2022Updated 4 years ago
- Representation of Module Activity☆10May 31, 2017Updated 8 years ago
- Graph-based framework to manipulate and analyze cell lineages from cell tracking data☆24Feb 20, 2026Updated last week
- Large Language Model Onnx Inference Framework☆35Nov 25, 2025Updated 3 months ago
- AI-ML-NLP Task Group☆13Aug 10, 2023Updated 2 years ago
- An R package to write Datalog queries and interact with a Datomic database☆11Aug 12, 2021Updated 4 years ago
- RADIS (Radiology Report Archive and Discovery System) is an innovative open-source web application developed by our team to enhance the m…☆12Updated this week
- Sample Level Analysis of Pathway Alteration Enrichments☆10Jan 21, 2019Updated 7 years ago
- rewrite python scipy.signal.lfilter in c code☆11Aug 13, 2019Updated 6 years ago