SYSTRAN/faster-whisper

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/SYSTRAN/faster-whisper)

SYSTRAN / faster-whisper

Faster Whisper transcription with CTranslate2

☆24,518

Alternatives and similar repositories for faster-whisper

Users that are interested in faster-whisper are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,252Jul 13, 2026Updated last week
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆105,550Apr 15, 2026Updated 3 months ago
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆52,290Jul 11, 2026Updated 2 weeks ago
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,672Jul 16, 2026Updated last week
OpenNMT / CTranslate2
View on GitHub
Fast inference engine for Transformer models
☆4,585Jul 3, 2026Updated 3 weeks ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ufal / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,656Nov 12, 2025Updated 8 months ago
collabora / WhisperLive
View on GitHub
A nearly-live implementation of OpenAI's Whisper.
☆4,153Jul 17, 2026Updated last week
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,809Aug 16, 2024Updated last year
Vaibhavs10 / insanely-fast-whisper
View on GitHub
☆12,996Oct 25, 2025Updated 9 months ago
speaches-ai / speaches
View on GitHub
☆3,540Updated this week
modelscope / FunASR
View on GitHub
Open-source speech recognition toolkit for training, inference, streaming ASR, VAD, punctuation, speaker diarization pipelines, and OpenA…
☆19,467Updated this week
fishaudio / fish-speech
View on GitHub
SOTA Open Source TTS
☆31,373Updated this week
sanchit-gandhi / whisper-jax
View on GitHub
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
☆4,685Apr 3, 2024Updated 2 years ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,334Updated this week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
huggingface / distil-whisper
View on GitHub
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
☆4,097Jan 8, 2025Updated last year
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,215Aug 19, 2024Updated last year
Softcatala / whisper-ctranslate2
View on GitHub
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆1,333Feb 14, 2026Updated 5 months ago
QwenAudio / SenseVoice
View on GitHub
Open-source SenseVoiceSmall model for Mandarin, Cantonese, English, Japanese, and Korean ASR, language ID, emotion recognition, and audio…
☆8,935Updated this week
ggml-org / llama.cpp
View on GitHub
LLM inference in C/C++
☆121,564Updated this week
Purfview / whisper-standalone-win
View on GitHub
Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.
☆3,127Nov 7, 2025Updated 8 months ago
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,608Feb 23, 2026Updated 5 months ago
QwenAudio / CosyVoice
View on GitHub
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
☆22,400May 25, 2026Updated 2 months ago
vllm-project / vllm
View on GitHub
A high-throughput and memory-efficient inference and serving engine for LLMs
☆87,138Updated this week
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
myshell-ai / OpenVoice
View on GitHub
Instant voice cloning by MIT and MyShell. Audio foundation model.
☆37,020Apr 19, 2025Updated last year
open-webui / open-webui
View on GitHub
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
☆146,695Updated this week
ollama / ollama
View on GitHub
Get up and running with Kimi-K2.6, GLM-5.2, MiniMax, DeepSeek, gpt-oss, Qwen, Gemma and other models.
☆176,850Updated this week
Comfy-Org / ComfyUI
View on GitHub
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
☆122,204Updated this week
BerriAI / litellm
View on GitHub
The fastest, litest AI Gateway. Rust core with Python SDK. Call 100+ LLM APIs in OpenAI (or native) format with cost tracking, guardrails…
☆54,668Updated this week
chidiwilliams / buzz
View on GitHub
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
☆20,385Updated this week
unslothai / unsloth
View on GitHub
Unsloth is a local UI for training and running Gemma 4, Qwen3.6, DeepSeek, Kimi, GLM and other models.
☆68,878Updated this week
KoljaB / RealtimeSTT
View on GitHub
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcri…
☆10,005Jun 12, 2026Updated last month
k2-fsa / sherpa-onnx
View on GitHub
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime…
☆13,783Updated this week
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
RVC-Boss / GPT-SoVITS
View on GitHub
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
☆60,109Updated this week
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,489Jun 2, 2026Updated last month
run-llama / llama_index
View on GitHub
LlamaIndex is the leading document agent and OCR platform
☆51,086Updated this week
2noise / ChatTTS
View on GitHub
A generative speech model for daily dialogue.
☆39,678Apr 10, 2026Updated 3 months ago
linto-ai / whisper-timestamped
View on GitHub
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
☆2,829Sep 9, 2025Updated 10 months ago
facebookresearch / seamless_communication
View on GitHub
Foundational Models for State-of-the-Art Speech and Text Translation
☆11,821Apr 8, 2026Updated 3 months ago
mem0ai / mem0
View on GitHub
Universal memory layer for AI Agents
☆61,666Updated this week