rhasspy / pymicro-vadLinks
Self-contained voice activity detector
☆33Updated 3 weeks ago
Alternatives and similar repositories for pymicro-vad
Users that are interested in pymicro-vad are comparing it to the libraries listed below
Sorting:
- C++ version of openWakeWord☆34Updated last year
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆71Updated last year
- On-device streaming text-to-speech engine powered by deep learning☆122Updated last month
- SEPIA server to support open-source speech recognition via WebSocket connection.☆132Updated 11 months ago
- ☆27Updated 2 years ago
- On-device voice activity detection (VAD) powered by deep learning☆232Updated last month
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆67Updated last year
- On-device noise suppression powered by deep learning☆76Updated 2 months ago
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆25Updated 2 months ago
- Official implementation of "WhisperNER: Unified Open Named Entity and Speech Recognition"☆196Updated 8 months ago
- On-device speaker recognition engine powered by deep learning☆37Updated 2 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆97Updated last year
- This public GitHub repository contains code for a fully self-hosted, on-premise transcription solution.☆54Updated 10 months ago
- ☆161Updated last year
- C++ library for converting text to phonemes for Piper☆134Updated 3 months ago
- All public LiveKit repos as a common repo to make searching and LLM inference easier.☆16Updated this week
- Experiments to test different speech recognition systems for SEPIA Framework☆63Updated 2 years ago
- Open models for Coqui STT☆146Updated 2 years ago
- A curated list of awesome voice activity detection☆67Updated 11 months ago
- ☆50Updated 2 weeks ago
- Real Time (WebRTC & WebTransport) Proxy for LLM WebSocket APIs☆42Updated 9 months ago
- C++17 port of Open-Unmix-PyTorch with streaming LSTM inference, ggml, quantization, and Eigen☆50Updated 7 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆119Updated 2 years ago
- streaming speech to text server using Whisper☆95Updated 2 years ago
- TTS-Wrapper makes it easier to use text-to-speech APIs by providing a unified and easy-to-use interface.☆32Updated 2 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆70Updated 3 months ago
- Tiny wrapper around webrtc-audio-processing for noise suppression/auto gain only☆29Updated last year
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆138Updated last year
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆217Updated last year