rhasspy / pymicro-vadLinks
Self-contained voice activity detector
☆31Updated last year
Alternatives and similar repositories for pymicro-vad
Users that are interested in pymicro-vad are comparing it to the libraries listed below
Sorting:
- C++ version of openWakeWord☆31Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last month
- SEPIA server to support open-source speech recognition via WebSocket connection.☆131Updated 11 months ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆71Updated last year
- On-device voice activity detection (VAD) powered by deep learning☆229Updated 2 weeks ago
- C++ library for converting text to phonemes for Piper☆134Updated 3 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated 2 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- A curated list of awesome voice activity detection☆66Updated 10 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆65Updated last year
- Open models for Coqui STT☆144Updated 2 years ago
- ☆46Updated last week
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Very fast, accurate speaker diarization☆145Updated last week
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆49Updated 6 months ago
- On-device noise suppression powered by deep learning☆74Updated 2 months ago
- streaming speech to text server using Whisper☆95Updated 2 years ago
- On-device speaker recognition engine powered by deep learning☆37Updated 2 months ago
- Port of Meta's Encodec in C/C++☆222Updated 10 months ago
- ☆156Updated last year
- Google Chrome Text to Speech command line client☆34Updated 4 years ago
- All public LiveKit repos as a common repo to make searching and LLM inference easier.☆15Updated last week
- A ggml (C++) re-implementation of tortoise-tts☆189Updated last year
- On-device speaker diarization powered by deep learning☆55Updated 2 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆25Updated last month
- Simple, energy-based voice activity detection algorithm implementation.☆17Updated last year
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆198Updated 7 months ago
- ☆28Updated last month
- ☆27Updated 2 years ago