moonshine-ai / onnx_shrink_rayLinks
Shrinks ONNX files by quantizing large float constants into eight bit equivalents.
β24Updated 6 months ago
Alternatives and similar repositories for onnx_shrink_ray
Users that are interested in onnx_shrink_ray are comparing it to the libraries listed below
Sorting:
- Use safetensors with ONNX π€β69Updated last month
- Port of Meta's Encodec in C/C++β226Updated 8 months ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobileβ¦β42Updated 11 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigenβ47Updated 4 months ago
- Model compression for ONNXβ97Updated 8 months ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,β¦β17Updated last year
- Lyra V2 (SoundStream) running in the browserβ19Updated last year
- ONNX implementation of Whisper. PyTorch free.β101Updated 8 months ago
- ONNX and TensorRT implementation of Whisperβ64Updated 2 years ago
- Profile your CoreML models directly from Python πβ28Updated 9 months ago
- trying to make WebGPU a bit easier to useβ16Updated last year
- Tracking states of the arts and recent results (bibliography) on sound tasks.β32Updated 2 years ago
- TTS support with GGMLβ143Updated 2 weeks ago
- GGML implementation of BERT model with Python bindings and quantization.β26Updated last year
- ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netronβ20Updated 2 years ago
- AI Edge Quantizer: flexible post training quantization for LiteRT models.β58Updated this week
- DiffQ performs differentiable quantization using pseudo quantization noise. It can automatically tune the number of bits used per weight β¦β236Updated 2 years ago
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIsβ17Updated 2 years ago
- Neural Network Libraries - C Runtimeβ55Updated 2 weeks ago
- A ggml (C++) re-implementation of tortoise-ttsβ188Updated 11 months ago
- A simple, hackable text-to-speech system in PyTorch and MLXβ170Updated last week
- Experiments with BitNet inference on CPUβ54Updated last year
- Lyra V2 WebAssembly buildβ30Updated 10 months ago
- Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.β17Updated 3 months ago
- β16Updated 2 years ago
- onnxruntime-extensions: A specialized pre- and post- processing library for ONNX Runtimeβ405Updated this week
- β16Updated last year
- β‘Delightful WebNN resources, curated list of awesome things around WebNN ecosystem.π