PINTO0309 / whisper-onnx-cpuLinks
ONNX implementation of Whisper. PyTorch free.
☆101Updated 7 months ago
Alternatives and similar repositories for whisper-onnx-cpu
Users that are interested in whisper-onnx-cpu are comparing it to the libraries listed below
Sorting:
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- A project that optimizes Whisper for low latency inference using NVIDIA TensorRT☆86Updated 9 months ago
- Nue-ASR inference code by rinna Co., Ltd.☆35Updated 11 months ago
- A very simple tool for situations where optimization with onnx-simplifier would exceed the Protocol Buffers upper file size limit of 2GB,…☆17Updated last year
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆115Updated last month
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- mnn asr demo.☆22Updated 3 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆67Updated last week
- A playground for experimenting with acoustic echo cancellation using a microphone, speaker, and ONNX.☆10Updated 8 months ago
- openvino version of openai/whisper☆168Updated last year
- Simple tool for partial optimization of ONNX. Further optimize some models that cannot be optimized with onnx-optimizer and onnxsim by se…☆19Updated last year
- A high-throughput and memory-efficient inference and serving engine for Whisper, https://mesolitica.com/blog/vllm-whisper☆28Updated 11 months ago
- Port of Funasr's Paraformer model in C/C++☆32Updated last year
- Fine-tuning Moshi/J-Moshi on your own spoken dialogue data☆59Updated 3 months ago
- Utilizes ONNX Runtime to transcribe audio into text.☆39Updated last week
- ONNX Inference of Pyannote Segmentation☆92Updated 6 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆100Updated 9 months ago
- PaddleOCRのPythonでのONNX推論サンプル☆43Updated 2 years ago
- ☆47Updated 11 months ago
- 44100Hz日本語音源に対応した MB-iSTFT-VITS: Lightweight and High-Fidelity End-to-End Text-to-Speech with Multi-Band Generation and Inverse Short-Tim…☆37Updated 2 years ago
- ☆56Updated 8 months ago
- Experiments with BitNet inference on CPU☆54Updated last year
- A lightweight end-to-end text-to-speech model☆115Updated 4 months ago
- TTS support with GGML☆127Updated 2 weeks ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆103Updated 2 years ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated 10 months ago
- A real-time implementation of Voice Activity Projection (VAP) is aimed at controlling behaviors of spoken dialogue systems, such as turn-…☆83Updated last week
- ncnn HiFi-GAN☆26Updated 9 months ago
- Running the F5-TTS by ONNX Runtime☆161Updated last week