Use safetensors with ONNX 🤗
☆89Mar 3, 2026Updated 2 weeks ago
Alternatives and similar repositories for onnx-safetensors
Users that are interested in onnx-safetensors are comparing it to the libraries listed below
Sorting:
- Visualize ONNX models with model-explorer☆69Feb 13, 2026Updated last month
- Some benchmarks☆12Sep 19, 2019Updated 6 years ago
- ONNX format parsing and manipulation in C#.☆33Jan 16, 2025Updated last year
- ☆35Jun 29, 2020Updated 5 years ago
- Model compression for ONNX☆99Mar 1, 2026Updated 2 weeks ago
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNX☆179Mar 10, 2026Updated last week
- ☆42Nov 29, 2022Updated 3 years ago
- Shrinks ONNX files by quantizing large float constants into eight bit equivalents.☆27Dec 5, 2025Updated 3 months ago
- Common utilities for ONNX converters☆295Dec 16, 2025Updated 3 months ago
- Profile your CoreML models directly from Python 🐍☆30Sep 8, 2025Updated 6 months ago
- the python api for axengine runtime☆26Mar 10, 2026Updated last week
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.☆90Mar 13, 2026Updated last week
- No-code CLI designed for accelerating ONNX workflows☆228Feb 19, 2026Updated last month
- The Triton backend for the ONNX Runtime.☆173Mar 10, 2026Updated last week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆432Updated this week
- Tokamax: A GPU and TPU kernel library.☆181Mar 13, 2026Updated last week
- ☆12Feb 5, 2024Updated 2 years ago
- ☆30Jan 24, 2026Updated last month
- 🤗 Optimum ExecuTorch☆120Mar 5, 2026Updated 2 weeks ago
- Tutorial on how to convert machine learned models into ONNX☆14Mar 11, 2023Updated 3 years ago
- The docs repository of Pulsar2 which is AXera's SoC 2rd AI toolchain. Such as AX650A, AX650N☆17Feb 12, 2026Updated last month
- Repository for ONNX SIG artifacts☆26Feb 14, 2026Updated last month
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.☆186Mar 4, 2026Updated 2 weeks ago
- A Web App for the game of Go/Baduk/Weiqi. Based on Plotly Dash and GoTextProtocol engines.☆12Apr 10, 2025Updated 11 months ago
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"☆11Mar 31, 2024Updated last year
- Repository for CPU Kernel Generation for LLM Inference☆28Jul 13, 2023Updated 2 years ago
- A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.☆32Sep 19, 2025Updated 6 months ago
- mnn tts demo.☆19May 7, 2025Updated 10 months ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆450Updated this week
- Awesome Quantization Paper lists with Codes☆10Feb 24, 2021Updated 5 years ago
- A tinder-like app for developers☆12Aug 3, 2023Updated 2 years ago
- Convert KataGo network files to ONNX format.☆14Nov 7, 2020Updated 5 years ago
- ONNX Optimizer☆800Mar 2, 2026Updated 2 weeks ago
- 8-bit floating point types for Rust☆63Feb 4, 2026Updated last month
- ☆16Apr 30, 2025Updated 10 months ago
- ☆16Apr 23, 2024Updated last year
- A nim module to handle polynomials☆13Jun 7, 2022Updated 3 years ago
- example of using CoreML from c++☆24Jun 14, 2023Updated 2 years ago
- A template code for running modular and reproducible experiments in pytorch☆13Sep 3, 2025Updated 6 months ago