rhasspy / pymicro-vadLinks
Self-contained voice activity detector
☆28Updated 10 months ago
Alternatives and similar repositories for pymicro-vad
Users that are interested in pymicro-vad are comparing it to the libraries listed below
Sorting:
- ☆139Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆64Updated last year
- Babylon.cpp is a C and C++ library for grapheme to phoneme conversion and text to speech synthesis. For phonemization a ONNX runtime port…☆21Updated 9 months ago
- Converts JSON-Schema to GBNF grammar to use with llama.cpp☆55Updated last year
- ☆26Updated 2 years ago
- Port of Suno AI's Bark in C/C++ for fast inference☆52Updated last year
- Faster Whisper ASR transcription with CTranslate2☆22Updated 8 months ago
- ☆22Updated this week
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆67Updated last year
- ☆57Updated 10 months ago
- LocalScore is an open benchmark which helps you understand how well your computer can handle local AI tasks.☆44Updated last week
- streaming speech to text server using Whisper☆93Updated 2 years ago
- A Javascript library (with Typescript types) to parse metadata of GGML based GGUF files.☆47Updated 10 months ago
- Joint speech-language model - respond directly to audio!☆30Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆119Updated last year
- 360M model running in the browser on WebGPU☆22Updated 10 months ago
- Vector functions and indexing for SQLite☆11Updated 2 years ago
- Extracts structured data from unstructured input. Programming language agnostic. Uses llama.cpp☆45Updated last year
- ☆21Updated 3 months ago
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆95Updated last year
- A curated list of awesome voice activity detection☆57Updated 7 months ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆63Updated last week
- Documentation for the Krixik Python client.☆38Updated 7 months ago
- The application performs real-time inference on audio from an ALSA capture device☆27Updated last week
- Website with current metrics on the fastest AI models.☆41Updated 7 months ago
- description: "An MCP server that enables LLMs to 'see' what's happening in browser-based games and applications through vectorized canv…☆34Updated 2 months ago
- Pybind11 bindings for Whisper.cpp☆58Updated 3 weeks ago
- On-device Speech-to-Index engine powered by deep learning☆36Updated 2 months ago
- On-device streaming text-to-speech engine powered by deep learning☆87Updated this week
- Embedding models from Jina AI☆60Updated last year