rhasspy / pymicro-vadLinks
Self-contained voice activity detector
☆36Updated last month
Alternatives and similar repositories for pymicro-vad
Users that are interested in pymicro-vad are comparing it to the libraries listed below
Sorting:
- C++ version of openWakeWord☆40Updated last year
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆72Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆241Updated last week
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆29Updated last year
- Kroko ASR - Speech-to-text☆130Updated 3 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆29Updated 4 months ago
- On-device streaming text-to-speech engine powered by deep learning☆127Updated last week
- A curated list of awesome voice activity detection☆71Updated last year
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆53Updated 10 months ago
- C++ library for converting text to phonemes for Piper☆137Updated 6 months ago
- On-device noise suppression powered by deep learning☆81Updated last week
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆73Updated 6 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆100Updated last year
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆35Updated 5 months ago
- Experiments to test different speech recognition systems for SEPIA Framework☆62Updated 2 years ago
- Open models for Coqui STT☆150Updated 2 years ago
- ☆29Updated 2 years ago
- ☆171Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆68Updated 2 years ago
- ☆54Updated last week
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆200Updated 11 months ago
- Very fast, accurate speaker diarization☆222Updated 3 weeks ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆31Updated last year
- On-device speaker recognition engine powered by deep learning☆39Updated last week
- Faster Whisper ASR transcription with CTranslate2☆24Updated last year
- An even smaller speech recognizer / force aligner☆37Updated last year
- On-device speaker diarization powered by deep learning☆63Updated last week
- SEPIA server to support open-source speech recognition via WebSocket connection.☆135Updated last year
- All public LiveKit repos as a common repo to make searching and LLM inference easier.☆26Updated last month
- A smartphone applications with Convolutional Neural Network Voice Activity Detector, Adaptive Noise Reduction and Dynamic Audio Range Com…☆20Updated 6 years ago