collabora/WhisperLive

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/collabora/WhisperLive)

collabora / WhisperLive

A nearly-live implementation of OpenAI's Whisper.

☆4,149

Alternatives and similar repositories for WhisperLive

Users that are interested in WhisperLive are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

ufal / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,653Nov 12, 2025Updated 8 months ago
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,479Nov 19, 2025Updated 8 months ago
collabora / WhisperFusion
View on GitHub
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
☆1,646Jul 31, 2024Updated last year
davabase / whisper_real_time
View on GitHub
Real time transcription with OpenAI Whisper.
☆2,939Apr 15, 2025Updated last year
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,189Jul 13, 2026Updated last week
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,625Dec 14, 2025Updated 7 months ago
speaches-ai / speaches
View on GitHub
☆3,538Updated this week
alesaccoia / VoiceStreamAI
View on GitHub
Near-Realtime audio transcription using self-hosted Whisper and WebSocket in Python/JS
☆959Oct 2, 2024Updated last year
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,655Jul 16, 2026Updated last week
huggingface / distil-whisper
View on GitHub
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
☆4,096Jan 8, 2025Updated last year
juanmc2005 / diart
View on GitHub
A python package to build AI-powered real-time audio applications
☆2,004Jun 19, 2026Updated last month
Vaibhavs10 / insanely-fast-whisper
View on GitHub
☆12,993Oct 25, 2025Updated 8 months ago
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆52,218Jul 11, 2026Updated last week
KoljaB / RealtimeSTT
View on GitHub
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…
☆10,000Jun 12, 2026Updated last month
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,607Feb 23, 2026Updated 5 months ago
KoljaB / RealtimeTTS
View on GitHub
Converts text to speech in realtime
☆3,997May 31, 2026Updated last month
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,319Updated this week
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆105,466Apr 15, 2026Updated 3 months ago
reriiasu / speech-to-text
View on GitHub
Real-time transcription using faster-whisper
☆614Jul 23, 2024Updated 2 years ago
OpenNMT / CTranslate2
View on GitHub
Fast inference engine for Transformer models
☆4,582Jul 3, 2026Updated 2 weeks ago
linto-ai / whisper-timestamped
View on GitHub
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
☆2,829Sep 9, 2025Updated 10 months ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,796Aug 16, 2024Updated last year
gaborvecsei / whisper-live-transcription
View on GitHub
Live-Transcription (STT) with Whisper PoC
☆201Jun 18, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
Softcatala / whisper-ctranslate2
View on GitHub
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆1,332Feb 14, 2026Updated 5 months ago
ahmetoner / whisper-asr-webservice
View on GitHub
OpenAI Whisper ASR Webservice API
☆3,303Nov 23, 2025Updated 8 months ago
QuentinFuxa / WhisperLiveKit
View on GitHub
Simultaneous speech-to-text models
☆10,556Updated this week
FunAudioLLM / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,925Updated this week
sanchit-gandhi / whisper-jax
View on GitHub
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
☆4,685Apr 3, 2024Updated 2 years ago
saharmor / whisper-playground
View on GitHub
Build real time speech2text web apps using OpenAI's Whisper https://openai.com/blog/whisper/
☆833Sep 12, 2025Updated 10 months ago
fixie-ai / ultravox
View on GitHub
A fast multimodal LLM for real-time voice
☆4,477Dec 12, 2025Updated 7 months ago
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆37,009Apr 19, 2025Updated last year
facebookresearch / seamless_communication
View on GitHub
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,818Apr 8, 2026Updated 3 months ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,314Aug 10, 2024Updated last year
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,361Jun 9, 2026Updated last month
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,435Updated this week
pipecat-ai / pipecat
View on GitHub
Open Source framework for voice and multimodal conversational AI
☆13,661Updated this week
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,735Updated this week
davabase / transcriber_app
View on GitHub
Real time speech to text transcription app.
☆439Jan 14, 2023Updated 3 years ago
myshell-ai / MeloTTS
View on GitHub
High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.
☆7,549Dec 24, 2024Updated last year