Picovoice / pvrecorderLinks
Cross-platform audio recorder designed for real-time speech audio processing
☆116Updated 2 weeks ago
Alternatives and similar repositories for pvrecorder
Users that are interested in pvrecorder are comparing it to the libraries listed below
Sorting:
- Pybind11 bindings for Whisper.cpp☆336Updated 8 months ago
- On-device voice activity detection (VAD) powered by deep learning☆223Updated this week
- whisper.cpp bindings for python☆100Updated last year
- Python bindings for whisper.cpp☆279Updated this week
- Python bindings for whisper.cpp☆242Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆118Updated 2 years ago
- On-device streaming text-to-speech engine powered by deep learning☆121Updated this week
- 🐸STT integration examples☆130Updated 2 years ago
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆68Updated last year
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆96Updated last year
- streaming speech to text server using Whisper☆94Updated 2 years ago
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated 3 weeks ago
- Open models for Coqui STT☆141Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translation☆120Updated last year
- TTS support with GGML☆143Updated 2 weeks ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆215Updated 9 months ago
- C++ library for converting text to phonemes for Piper☆128Updated last month
- A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection☆111Updated last year
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated 2 years ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆87Updated last year
- web based editor for subtitles and transcripts☆137Updated 11 months ago
- ez audio transcription tool with flexible processing and post-processing options☆156Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆505Updated last year
- faster-whisper as serverless endpoint☆112Updated 2 months ago
- A lightweight end-of-utterance detection model fine-tuned on SmolLM2-135M, optimized for Raspberry Pi and low-power devices.☆25Updated 4 months ago
- On-device speaker recognition engine powered by deep learning☆37Updated this week
- On-device noise suppression powered by deep learning☆73Updated 3 weeks ago
- A library for real-time voice processing in web browsers☆226Updated this week
- A speech recognition library running in the browser thanks to a WebAssembly build of Vosk☆474Updated last year