moonshine-ai / onnx_shrink_rayLinks
Shrinks ONNX files by quantizing large float constants into eight bit equivalents.
β26Updated 3 weeks ago
Alternatives and similar repositories for onnx_shrink_ray
Users that are interested in onnx_shrink_ray are comparing it to the libraries listed below
Sorting:
- Use safetensors with ONNX π€β70Updated last week
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobileβ¦β42Updated last year
- Port of Meta's Encodec in C/C++β225Updated 10 months ago
- ONNX implementation of Whisper. PyTorch free.β99Updated 10 months ago
- [DEPRECEATED] Multi-Instrumental Music Transformer trained on 12GB/400k MIDIsβ17Updated 3 years ago
- Lyra V2 (SoundStream) running in the browserβ18Updated 2 years ago
- ONNX and TensorRT implementation of Whisperβ64Updated 2 years ago
- A fast MP3 decoder for python, using minimp3β30Updated 3 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigenβ49Updated 6 months ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRTβ90Updated 11 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime portβ¦β24Updated last month
- Model compression for ONNXβ97Updated 10 months ago
- Profile your CoreML models directly from Python πβ28Updated last month
- Lyra V2 WebAssembly buildβ30Updated last year
- ONNX Script editor & visualiser running completely in the browser thanks to Pyodide and Netronβ20Updated 2 years ago
- Acoustic Neighbor Embeddingsβ28Updated 2 months ago
- Experiments with BitNet inference on CPUβ54Updated last year
- AI Edge Quantizer: flexible post training quantization for LiteRT models.β68Updated this week
- openvino version of openai/whisperβ175Updated last year
- TTS support with GGMLβ180Updated this week
- trying to make WebGPU a bit easier to useβ17Updated last year
- Using large language models to maintain AI_CHANGELOG.mdβ14Updated last year
- Export an ONNX graph that performs ISTFT. Designed for TTS models.β27Updated last year
- zero-shot realtime TTS system, fully offline, free and open sourceβ46Updated 5 months ago
- β17Updated last year
- A ggml (C++) re-implementation of tortoise-ttsβ190Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisperβ31Updated last year
- Tracking states of the arts and recent results (bibliography) on sound tasks.β32Updated 2 years ago
- β‘Delightful WebNN resources, curated list of awesome things around WebNN ecosystem.πβ67Updated 3 months ago
- A fork of Lyra V2 (a low-bitrate neural audio codec) that supports a webassembly build.β30Updated 2 years ago