gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆175Updated 9 months ago
Alternatives and similar repositories for whisper-live-transcription:
Users that are interested in whisper-live-transcription are comparing it to the libraries listed below
- Whisper realtime streaming for long speech-to-text transcription and translation☆113Updated last year
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆342Updated 9 months ago
- Whispering Tiger - OpenAI's whisper (and other models) with OSC and Websocket support. Allowing live transcription / translation in VRCha…☆426Updated this week
- Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS☆829Updated 5 months ago
- Speech Diarization for scrum automation☆102Updated last year
- Open source inference code for Rev's model☆383Updated 2 weeks ago
- An API to transcribe audio with OpenAI's Whisper Large v3!☆252Updated 4 months ago
- FastAPI service on top of WhisperX☆72Updated this week
- Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts☆312Updated 4 months ago
- Real-Time Whisper Voice Recognition with vosk model feedback.☆112Updated last year
- Record audio and save a transcription to your system's clipboard with ctranslate2 and faster-whisper.☆99Updated last month
- Speech recognition & diarisation solution with text alignment, deployed in AML pipelines☆94Updated 10 months ago
- Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection☆632Updated 3 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆92Updated 10 months ago
- ez audio transcription tool with flexible processing and post-processing options☆146Updated last year
- streaming speech to text server using Whisper☆90Updated last year
- The subtitles and translations are generated in real-time and displayed as pop-ups.☆152Updated last year
- speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…☆194Updated last month
- Have a natural voice conversation with an LLM☆243Updated 3 months ago
- web based editor for subtitles and transcripts☆126Updated 7 months ago
- An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine☆373Updated 6 months ago
- ☆36Updated 2 years ago
- Effortlessly record, transcribe, and summarize meetings with this user-friendly desktop utility powered by OpenAI's Whisper and GPT-3.5-t…☆175Updated last year
- Shush is an app that deploys a WhisperV3 model with Flash Attention v2 on Modal and makes requests to it via a NextJS app☆198Updated 9 months ago
- An JS web client for connecting to Pipecat bots with voice and vision☆43Updated 3 months ago
- OpenAI Whisper API-style local server, runnig on FastAPI☆75Updated 3 months ago
- ☆246Updated 2 years ago
- OpenAI API and Whisper based Video Translation☆72Updated 3 months ago
- Self hosted high quality voice recognition for de-googled Android using whisper. Like Siri or OK Google.☆60Updated last year
- Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens☆480Updated last year