appvoid / vosperLinks
Real-Time Whisper Voice Recognition with vosk model feedback.
β119Updated 2 years ago
Alternatives and similar repositories for vosper
Users that are interested in vosper are comparing it to the libraries listed below
Sorting:
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ97Updated last year
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β217Updated last year
- streaming speech to text server using Whisperβ95Updated 2 years ago
- openvino version of openai/whisperβ176Updated 2 years ago
- A curated list of awesome OpenAI's Whisperβ98Updated 2 years ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitlesβ80Updated 2 years ago
- Whisper realtime streaming for long speech-to-text transcription and translationβ121Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β67Updated last year
- Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)β101Updated 2 months ago
- web based editor for subtitles and transcriptsβ141Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β119Updated 2 years ago
- ez audio transcription tool with flexible processing and post-processing optionsβ159Updated last year
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ254Updated 3 years ago
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β239Updated 2 months ago
- whisper.cpp bindings for pythonβ107Updated 2 years ago
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.β71Updated 4 months ago
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ347Updated last year
- π π€ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningβ160Updated last year
- Open models for Coqui STTβ146Updated 2 years ago
- Efficient approach to speaker diarization using voice characteristics extractionβ104Updated 4 months ago
- A nearly-live implementation of OpenAI's Whisper, using sounddevice. Requires existing Whisper install.β357Updated 3 months ago
- On-device voice activity detection (VAD) powered by deep learningβ233Updated last month
- β350Updated last year
- SEPIA server to support open-source speech recognition via WebSocket connection.β134Updated last year
- Live-Transcription (STT) with Whisper PoCβ200Updated last year
- TTS with The Massively Multilingual Speech (MMS) projectβ230Updated last year
- On-device noise suppression powered by deep learningβ76Updated 3 months ago
- A testing repo to share code and thoughts on diarisationβ56Updated last year
- Pybind11 bindings for Whisper.cppβ340Updated 11 months ago