rhasspy / pymicro-vadLinks
Self-contained voice activity detector
☆29Updated last year
Alternatives and similar repositories for pymicro-vad
Users that are interested in pymicro-vad are comparing it to the libraries listed below
Sorting:
- On-device streaming text-to-speech engine powered by deep learning☆121Updated this week
- C++ library for converting text to phonemes for Piper☆128Updated last month
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆23Updated last week
- Open models for Coqui STT☆141Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆195Updated 5 months ago
- ☆150Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.☆128Updated 9 months ago
- On-device noise suppression powered by deep learning☆73Updated this week
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆68Updated last year
- ☆27Updated 2 years ago
- A ggml (C++) re-implementation of tortoise-tts☆188Updated 11 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated 3 weeks ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Port of Meta's Encodec in C/C++☆226Updated 8 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆47Updated 4 months ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆28Updated this week
- An API to transcribe audio with OpenAI's Whisper Large v3!☆296Updated 8 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- ☆32Updated last month
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and …☆466Updated last month
- Open Audio Watermarking Tool☆237Updated last month
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- GGML implementation of BERT model with Python bindings and quantization.☆57Updated last year
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆49Updated last year
- ☆61Updated 11 months ago
- On-device speaker recognition engine powered by deep learning☆37Updated this week
- Speaker diarization model☆28Updated 2 years ago