rhasspy / pymicro-vadLinks
Self-contained voice activity detector
☆30Updated last year
Alternatives and similar repositories for pymicro-vad
Users that are interested in pymicro-vad are comparing it to the libraries listed below
Sorting:
- C++ library for converting text to phonemes for Piper☆132Updated last month
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆70Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆120Updated 3 weeks ago
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- A curated list of awesome voice activity detection☆62Updated 9 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆47Updated 5 months ago
- On-device voice activity detection (VAD) powered by deep learning☆228Updated 3 weeks ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- Port of Meta's Encodec in C/C++☆224Updated 8 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆63Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆23Updated this week
- On-device noise suppression powered by deep learning☆74Updated 3 weeks ago
- C++ version of openWakeWord☆29Updated last year
- Open models for Coqui STT☆141Updated 2 years ago
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆25Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated last month
- ☆34Updated 2 weeks ago
- ONNX Inference of Pyannote Segmentation☆93Updated 8 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- ☆154Updated last year
- ez audio transcription tool with flexible processing and post-processing options☆158Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- On-device speaker recognition engine powered by deep learning☆37Updated 3 weeks ago
- On-device speaker diarization powered by deep learning☆52Updated 3 weeks ago
- A ggml (C++) re-implementation of tortoise-tts☆187Updated last year
- streaming speech to text server using Whisper☆94Updated 2 years ago
- ☆16Updated 4 months ago
- ☆27Updated 2 years ago
- ☆22Updated 7 months ago