justinchuby / onnx-safetensorsLinks
Use safetensors with ONNX 🤗
☆69Updated 2 weeks ago
Alternatives and similar repositories for onnx-safetensors
Users that are interested in onnx-safetensors are comparing it to the libraries listed below
Sorting:
- Model compression for ONNX☆97Updated 10 months ago
- No-code CLI designed for accelerating ONNX workflows☆214Updated 3 months ago
- ONNX Script enables developers to naturally author ONNX functions and models using a subset of Python.☆381Updated last week
- Python bindings for ggml☆146Updated last year
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtime☆411Updated last week
- Inference Vision Transformer (ViT) in plain C/C++ with ggml☆294Updated last year
- Thin wrapper around GGML to make life easier☆40Updated 2 months ago
- Visualize ONNX models with model-explorer☆39Updated 3 months ago
- An innovative library for efficient LLM inference via low-bit quantization☆348Updated last year
- 🤗 Optimum Intel: Accelerate inference with Intel optimization tools☆489Updated last week
- Common utilities for ONNX converters☆279Updated 2 weeks ago
- A safetensors extension to efficiently store sparse quantized tensors on disk☆161Updated this week
- A Toolkit to Help Optimize Onnx Model☆214Updated last week
- AMD related optimizations for transformer models☆88Updated 3 weeks ago
- GGUF parser in Python☆28Updated last year
- python package of rocm-smi-lib☆23Updated 2 months ago
- 🤗 Optimum ExecuTorch☆63Updated this week
- 🤗 Optimum ONNX: Export your model to ONNX and run inference with ONNX Runtime☆43Updated this week
- AI Edge Quantizer: flexible post training quantization for LiteRT models.☆65Updated this week
- Open deep learning compiler stack for cpu, gpu and specialized accelerators☆34Updated 2 years ago
- PyTorch half precision gemm lib w/ fused optional bias + optional relu/gelu☆73Updated 9 months ago
- TTS support with GGML☆176Updated 3 weeks ago
- A high-throughput and memory-efficient inference and serving engine for LLMs☆266Updated 11 months ago
- The Triton backend for the ONNX Runtime.☆161Updated last week
- Module, Model, and Tensor Serialization/Deserialization☆265Updated 3 weeks ago
- ☆68Updated 2 years ago
- ☆17Updated 9 months ago
- OpenVINO Tokenizers extension☆40Updated last week
- Easy and lightning fast training of 🤗 Transformers on Habana Gaudi processor (HPU)☆194Updated this week
- Common source, scripts and utilities shared across all Triton repositories.☆76Updated last week