gumblex / whisper_vadLinks
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆19Updated 11 months ago
Alternatives and similar repositories for whisper_vad
Users that are interested in whisper_vad are comparing it to the libraries listed below
Sorting:
- Port of Funasr's Paraformer model in C/C++☆34Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆42Updated last year
- ONNX and TensorRT implementation of Whisper☆64Updated 2 years ago
- GFPGAN face reconstruction with ncnn on a bare Raspberry Pi☆13Updated 2 years ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆49Updated 6 months ago
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆28Updated last year
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆16Updated last year
- Inference TinyLlama models on ncnn☆24Updated 2 years ago
- A fast MP3 decoder for python, using minimp3☆28Updated 3 years ago
- libvits-ncnn is an ncnn implementation of the VITS library that enables cross-platform GPU-accelerated speech synthesis.🎙️💻☆62Updated 2 years ago
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆42Updated 2 years ago
- C++ library for converting text to phonemes for Piper☆134Updated 3 months ago
- Port of Meta's Encodec in C/C++☆222Updated 10 months ago
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆82Updated 2 weeks ago
- Experiments with BitNet inference on CPU☆54Updated last year
- A lightweight end-to-end text-to-speech model☆120Updated 7 months ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆35Updated 7 months ago
- The CPP version of Silero VAD: pre-trained enterprise-grade Voice Activity Detector☆23Updated last year
- Onnx compatible styletts2 code☆13Updated 4 months ago
- LiveKit SDK for Embedded☆56Updated 11 months ago
- Web browser version of StarCoder.cpp☆44Updated 2 years ago
- NNSE (Neural Network Speech Enhancement) is a speech-denoiser optimized to run on Ambiq's low power platform☆41Updated 3 weeks ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆157Updated last year
- This is a repository that collects common audio noise reduction models, using Gradio to demonstrate the use of each model, which is very …☆46Updated 10 months ago
- Open models for Coqui STT☆144Updated 2 years ago
- ☆60Updated 3 weeks ago
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆103Updated last week
- openvino version of openai/whisper☆14Updated last year