Use safetensors with ONNX π€
β89Jun 23, 2026Updated this week
Alternatives and similar repositories for onnx-safetensors
Users that are interested in onnx-safetensors are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Visualize ONNX models with model-explorerβ72May 19, 2026Updated last month
- Array APIs to write ONNX Graphsβ11Jan 18, 2026Updated 5 months ago
- Some benchmarksβ12Sep 19, 2019Updated 6 years ago
- CLI utility to inspect and explore .safetensors and .gguf filesβ58Jun 20, 2026Updated last week
- The backend behind the LLM-Perf Leaderboardβ11May 5, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean β’ AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Model compression for ONNXβ101May 1, 2026Updated last month
- .Net wrapper for XGBoostβ21Updated this week
- QONNX: Arbitrary-Precision Quantized Neural Networks in ONNXβ191Jun 10, 2026Updated 2 weeks ago
- ζ₯ ζ¬θͺεΉ³ζγη΅ηγ«εγγγ¦ζι©εγγγεΊηε²δΈζγγη―δ»γͺζ©θ½β14Aug 27, 2020Updated 5 years ago
- β42Nov 29, 2022Updated 3 years ago
- Shrinks ONNX files by quantizing large float constants into eight bit equivalents.β29Dec 5, 2025Updated 6 months ago
- Common utilities for ONNX convertersβ300Dec 16, 2025Updated 6 months ago
- Large Language Model Onnx Inference Frameworkβ35Nov 25, 2025Updated 7 months ago
- Profile your CoreML models directly from Python πβ30Sep 8, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- the python api for axengine runtimeβ26Mar 24, 2026Updated 3 months ago
- Experimental wasm32-unknown-wasi runtime for Python code executionβ40Nov 28, 2024Updated last year
- π€ Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtimeβ148Jun 15, 2026Updated 2 weeks ago
- Deep Speech Distances PyTorchβ29Feb 21, 2022Updated 4 years ago
- Hugging Face Inference Toolkit used to serve transformers, sentence-transformers, and diffusers models.β96May 28, 2026Updated last month
- The Triton backend for the ONNX Runtime.β180Updated this week
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.β443Jun 19, 2026Updated last week
- β30May 10, 2026Updated last month
- β12Feb 5, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- AMD SMIβ131May 28, 2026Updated last month
- The docs repository of Pulsar2 which is AXera's SoC 2rd AI toolchain. Such as AX650A, AX650Nβ18Apr 23, 2026Updated 2 months ago
- A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ/VPTQ, and export to onnx/onnx-runtime easily.β190Mar 23, 2026Updated 3 months ago
- A Web App for the game of Go/Baduk/Weiqi. Based on Plotly Dash and GoTextProtocol engines.β12Apr 10, 2025Updated last year
- π€ Optimum ExecuTorchβ132May 26, 2026Updated last month
- Supporting code for "LLMs for your iPhone: Whole-Tensor 4 Bit Quantization"β11Mar 31, 2024Updated 2 years ago
- Repository for CPU Kernel Generation for LLM Inferenceβ28Jul 13, 2023Updated 2 years ago
- mnn tts demo.β19May 7, 2025Updated last year
- Your one stop CLI for ONNX model analysis.β48Nov 13, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtimeβ469Jun 22, 2026Updated last week
- Awesome Quantization Paper lists with Codesβ10Feb 24, 2021Updated 5 years ago
- Convert KataGo network files to ONNX format.β15Nov 7, 2020Updated 5 years ago
- ONNX Optimizerβ819Jun 12, 2026Updated 2 weeks ago
- β18Apr 30, 2025Updated last year
- 8-bit floating point types for Rustβ64Feb 4, 2026Updated 4 months ago
- β16Apr 23, 2024Updated 2 years ago