appvoid / vosperLinks
Real-Time Whisper Voice Recognition with vosk model feedback.
β112Updated last year
Alternatives and similar repositories for vosper
Users that are interested in vosper are comparing it to the libraries listed below
Sorting:
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelinesβ94Updated last year
- π¬ ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.β213Updated 7 months ago
- Pybind11 bindings for Whisper.cppβ330Updated 5 months ago
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated textsβ326Updated 6 months ago
- faster-whisper livestream translation, OBS noise reduction, dual language subtitlesβ78Updated 2 years ago
- whisper.cpp bindings for pythonβ96Updated last year
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.β64Updated last year
- streaming speech to text server using Whisperβ92Updated last year
- openvino version of openai/whisperβ166Updated last year
- A curated list of awesome OpenAI's Whisperβ101Updated last year
- Python bindings for whisper.cppβ258Updated this week
- Improving transcription performance of OpenAI Whisper for CPU based deploymentβ244Updated 2 years ago
- Simplified diarization pipeline using some pretrained models - audio file to diarized segments in a few lines of codeβ149Updated last year
- Efficient approach to speaker diarization using voice characteristics extractionβ94Updated last year
- On-device voice activity detection (VAD) powered by deep learningβ216Updated 3 weeks ago
- web based editor for subtitles and transcriptsβ133Updated 9 months ago
- π€ Nix-TTS: Lightweight and End-to-end Text-to-Speech via Module-wise Distillationβ253Updated last year
- Streaming transcriber with whisperβ686Updated 2 years ago
- A quick experiment to achieve almost realtime transcription using Whisper.β187Updated 2 years ago
- Listen to any audio stream on your machine and print out the transcribed or translated audio.β119Updated last year
- Whisper realtime streaming for long speech-to-text transcription and translationβ116Updated last year
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engineβ417Updated 9 months ago
- ez audio transcription tool with flexible processing and post-processing optionsβ150Updated last year
- β356Updated last year
- β156Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts withβ¦β217Updated last month
- Code and Pretrained Models for Interspeech 2023 Paper "Whisper-AT: Noise-Robust Automatic Speech Recognizers are Also Strong Audio Event β¦β384Updated last year
- Python bindings for whisper.cppβ235Updated last year
- β37Updated 2 years ago
- Accelerating faster-whisper single file processing by multiprocessing through parallelizationβ53Updated 2 years ago