gumblex / whisper_vad
Whisper.cpp Speech-to-text with Voice Acticity Detection
☆16Updated 5 months ago
Alternatives and similar repositories for whisper_vad:
Users that are interested in whisper_vad are comparing it to the libraries listed below
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆21Updated 9 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- Uses the excellent silero VAD with onnxruntime C api for fast detection of audio segments with speech☆12Updated 7 months ago
- Python bindings of speexdsp noise suppression library☆38Updated 2 years ago
- ONNX-compatible Fast SeamlessM4T—Massively Multilingual & Multimodal Machine Translation☆43Updated last year
- A lightweight pure C++ Text-to-Speech (TTS) pipeline with OpenVINO, supporting multiple languages.☆52Updated last week
- An open source NLP as a service project focused on providing state of the art systems with ease. Training and inference by simple docker …☆20Updated 7 months ago
- XCORE-VOICE Solution☆14Updated last week
- a cpp ggml port of "VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech." for use in mobile…☆39Updated 8 months ago
- Port of Funasr's Paraformer model in C/C++☆32Updated 10 months ago
- Faster Whisper ASR transcription with CTranslate2☆20Updated 6 months ago
- SpeechDenoiser: Real-Time Speech Denoising with ONNX Welcome to SpeechDenoiser, a simple and effective solution for real-time speech den…☆72Updated 8 months ago
- ONNX and TensorRT implementation of Whisper☆61Updated last year
- BUD-E (Buddy) is an open-source voice assistant framework that facilitates seamless interaction with AI models and APIs, enabling the cre…☆19Updated 6 months ago
- ☆11Updated 3 years ago
- Accelerate Whisper tasks such as transcription, by multiprocesing through parallelization☆25Updated 2 years ago
- Simple, energy-based voice activity detection algorithm implementation.☆17Updated last year
- NNSE (Neural Network Speech Enhancement) is a speech-denoiser optimized to run on Ambiq's low power platform☆38Updated last week
- faster-whisper livestream translation, OBS noise reduction, dual language subtitles☆78Updated last year
- Speaker diarization service☆21Updated last week
- zero-shot realtime TTS system, fully offline, free and open source☆34Updated last week
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation☆98Updated last week
- A fast MP3 decoder for python, using minimp3☆28Updated 2 years ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆63Updated 11 months ago
- ONNX implementation of Whisper. PyTorch free.☆94Updated 5 months ago
- Whisper combined with Silero VAD, for improved long-form transcriptions☆49Updated 2 years ago
- Open models for Coqui STT☆137Updated last year
- A client/server app for real‑time voice chat with AI. Live speech‑to‑text, instant AI replies.☆12Updated this week
- Robust Speech Recognition via Large-Scale Weak Supervision☆30Updated last year
- A cross platform implementation of Text-to-Speech based on ONNXRuntime.☆32Updated last year