QuentinFuxa / whisper_streaming_web

Whisper Streaming with Websocket and Fastapi server

☆87

Alternatives and similar repositories for whisper_streaming_web:

Users that are interested in whisper_streaming_web are comparing it to the libraries listed below

nyrahealth / CrisperWhisper
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆584Updated 2 months ago
alesaccoia / VoiceStreamAI
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
☆812Updated 4 months ago
speaches-ai / speaches
☆1,402Updated this week
ufal / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆2,471Updated last month
pavelzbornik / whisperX-FastAPI
FastAPI service on top of WhisperX
☆68Updated 3 weeks ago
gaborvecsei / whisper-live-transcription
Live-Transcription (STT) with Whisper PoC
☆173Updated 8 months ago
revdotcom / reverb
Open source inference code for Rev's model
☆377Updated last month
collabora / WhisperLive
A nearly-live implementation of OpenAI's Whisper.
☆2,448Updated 2 weeks ago
juanmc2005 / diart
A python package to build AI-powered real-time audio applications
☆1,186Updated last week
coqui-ai / xtts-streaming-server
☆312Updated 7 months ago
luweigen / whisper_streaming
Whisper realtime streaming for long speech-to-text transcription and translation
☆111Updated last year
reriiasu / speech-to-text
Real-time transcription using faster-whisper
☆448Updated 6 months ago
jim60105 / docker-whisperX
Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …
☆225Updated 2 weeks ago
shashikg / WhisperS2T
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
☆360Updated 5 months ago
edwko / OuteTTS
Interface for OuteTTS models.
☆926Updated last week
absadiki / pywhispercpp
Python bindings for whisper.cpp
☆221Updated this week
facebookresearch / spiritlm
Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".
☆879Updated 3 months ago
NavodPeiris / speechlib
speechlib is a library that can do speaker diarization, transcription and speaker recognition on an audio file to create transcripts with…
☆190Updated last week
aiola-lab / whisper-medusa
Whisper with Medusa heads
☆822Updated last week
astramind-ai / Auralis
A Fast TTS Engine
☆451Updated 3 weeks ago
Softcatala / open-dubbing
Open dubbing is an AI dubbing system which uses machine learning models to automatically translate and synchronize audio dialogue into di…
☆158Updated last week
EtienneAb3d / WhisperHallu
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆306Updated 3 months ago
nalbion / whisper-server
streaming speech to text server using Whisper
☆86Updated last year
yohasebe / whisper-stream
A bash script using OpenAI Whisper API for continuous audio transcription with automatic silence detection
☆102Updated 9 months ago
davabase / transcriber_app
Real time speech to text transcription app.
☆397Updated 2 years ago
hanifabd / voice-activity-detection-vad-realtime
Real-time Voice Activity Detection (VAD) with some example use case like simple voice bot and live transcription (realtime transcription)
☆65Updated 8 months ago
KoljaB / Linguflex
Command Your World with Voice
☆588Updated 2 months ago
lhl / voicechat2
Local SRT/LLM/TTS Voicechat
☆620Updated 4 months ago
hexgrad / kokoro
https://hf.co/hexgrad/Kokoro-82M
☆1,173Updated this week
nazdridoy / kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…
☆177Updated this week