Picovoice / pvrecorder
Cross-platform audio recorder designed for real-time speech audio processing
☆106Updated this week
Alternatives and similar repositories for pvrecorder:
Users that are interested in pvrecorder are comparing it to the libraries listed below
- On-device voice activity detection (VAD) powered by deep learning☆206Updated this week
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 11 months ago
- On-device noise suppression powered by deep learning☆69Updated this week
- faster-whisper as serverless endpoint☆95Updated this week
- whisper-cpp-serve Real-time speech recognition and c+ of OpenAI's Whisper model in C/C++☆62Updated 11 months ago
- 💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.☆207Updated 5 months ago
- On-device streaming text-to-speech engine powered by deep learning☆76Updated this week
- Python bindings for whisper.cpp☆242Updated last week
- Pybind11 bindings for Whisper.cpp☆328Updated 4 months ago
- On-device speaker recognition engine powered by deep learning☆34Updated this week
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆321Updated 5 months ago
- FastAPI service on top of WhisperX☆83Updated last week
- ez audio transcription tool with flexible processing and post-processing options☆149Updated last year
- whisper.cpp bindings for python☆94Updated last year
- Rust bindings to https://github.com/k2-fsa/sherpa-onnx☆160Updated 3 weeks ago
- Port of Microsoft's BioGPT in C/C++ using ggml☆88Updated last year
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Demo python script app to interact with llama.cpp server using whisper API, microphone and webcam devices.☆46Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆205Updated last week
- Pybind11 bindings for Whisper.cpp☆55Updated 2 weeks ago
- Experiments to test different speech recognition systems for SEPIA Framework☆60Updated last year
- Faster Whisper ASR transcription with CTranslate2☆20Updated 5 months ago
- ONNX Inference of Pyannote Segmentation☆85Updated 3 months ago
- C++ library for converting text to phonemes for Piper☆115Updated last year
- Python bindings for whisper.cpp☆234Updated 10 months ago
- Voice activity detection (VAD) library, based on WebRTC's VAD engine built to WASM with Emscripten to run in browsers, Node, and NativeSc…☆29Updated 9 months ago
- Wake word detection engine based on Snips Personal Wakeword Detector☆54Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆62Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆158Updated 9 months ago