shashikg/WhisperS2T

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/shashikg/WhisperS2T)

shashikg / WhisperS2T

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

☆577

Alternatives and similar repositories for WhisperS2T

Users that are interested in WhisperS2T are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

BBC-Esq / WhisperS2T-transcriber
View on GitHub
Uses the powerful WhisperS2T and Ctranslate2 libraries to batch transcribe multiple files
☆78Jun 20, 2026Updated last month
PINTO0309 / whisper-onnx-tensorrt
View on GitHub
ONNX and TensorRT implementation of Whisper
☆69May 27, 2023Updated 3 years ago
nyrahealth / CrisperWhisper
View on GitHub
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
☆1,031Updated this week
Vaibhavs10 / insanely-fast-whisper
View on GitHub
☆12,997Oct 25, 2025Updated 9 months ago
MahmoudAshraf97 / whisper-diarization
View on GitHub
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
☆5,614Feb 23, 2026Updated 5 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,320Jul 13, 2026Updated 2 weeks ago
OpenNMT / CTranslate2
View on GitHub
Fast inference engine for Transformer models
☆4,596Jul 3, 2026Updated 3 weeks ago
EtienneAb3d / WhisperHallu
View on GitHub
Experimental code: sound file preprocessing to optimize Whisper transcriptions without hallucinated texts
☆351Nov 12, 2024Updated last year
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,609Nov 19, 2025Updated 8 months ago
ufal / whisper_streaming
View on GitHub
Whisper realtime streaming for long speech-to-text transcription and translation
☆3,657Nov 12, 2025Updated 8 months ago
collabora / WhisperLive
View on GitHub
A nearly-live implementation of OpenAI's Whisper.
☆4,192Updated this week
huggingface / distil-whisper
View on GitHub
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
☆4,099Jan 8, 2025Updated last year
Wordcab / wordcab-transcribe
View on GitHub
💬 ASR FastAPI server using faster-whisper and Multi-Scale Auto-Tuning Spectral Clustering for diarization.
☆219Oct 30, 2024Updated last year
akashmjn / tinydiarize
View on GitHub
Minimal extension of OpenAI's Whisper adding speaker diarization with special tokens
☆549Nov 6, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
linto-ai / whisper-timestamped
View on GitHub
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
☆2,832Sep 9, 2025Updated 10 months ago
alphacep / whisper-prompts
View on GitHub
OpenAI Whisper Prompt Examples
☆53Jul 17, 2023Updated 3 years ago
juanmc2005 / diart
View on GitHub
A python package to build AI-powered real-time audio applications
☆2,007Jun 19, 2026Updated last month
mobiusml / faster-whisper
View on GitHub
Faster Whisper ASR transcription with CTranslate2
☆25Oct 25, 2024Updated last year
Softcatala / whisper-ctranslate2
View on GitHub
Whisper command line client compatible with original OpenAI client based on CTranslate2.
☆1,332Feb 14, 2026Updated 5 months ago
MiniXC / LightningFastSpeech2
View on GitHub
☆55Jan 13, 2023Updated 3 years ago
ex3ndr / supervoice-gpt
View on GitHub
GPT-style network for phonemization with durations of text
☆68Mar 21, 2024Updated 2 years ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,351Updated this week
huggingface / speechbox
View on GitHub
☆358Mar 17, 2024Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
csukuangfj / kaldi_native_io
View on GitHub
python wrapper for kaldi's native I/O
☆27Jan 9, 2025Updated last year
jianfch / stable-ts
View on GitHub
Transcription, forced alignment, and audio indexing with OpenAI's Whisper
☆2,282May 30, 2026Updated last month
NeuralVox / OpenPhonemizer
View on GitHub
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆111Mar 15, 2026Updated 4 months ago
zjlww / dsp
View on GitHub
Digital Speech Processing in PyTorch.
☆15Aug 12, 2022Updated 3 years ago
sanchit-gandhi / whisper-jax
View on GitHub
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
☆4,685Apr 3, 2024Updated 2 years ago
WhisperSpeech / WhisperSpeech
View on GitHub
An Open Source text-to-speech system built by inverting Whisper.
☆4,625Dec 14, 2025Updated 7 months ago
aiola-lab / whisper-medusa
View on GitHub
Whisper with Medusa heads
☆861Jul 2, 2026Updated 3 weeks ago
Vaibhavs10 / optimise-my-whisper
View on GitHub
☆207May 27, 2024Updated 2 years ago
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,791Jul 16, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
speaches-ai / speaches
View on GitHub
☆3,548Updated this week
knoriy / CLARA
View on GitHub
☆62Jul 25, 2024Updated 2 years ago
google / speaker-id
View on GitHub
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at…
☆453Aug 12, 2025Updated 11 months ago
KoljaB / Linguflex
View on GitHub
Command Your World with Voice
☆812Jun 17, 2025Updated last year
NVIDIA-AI-IOT / whisper_trt
View on GitHub
A project that optimizes Whisper for low latency inference using NVIDIA TensorRT
☆110Oct 15, 2024Updated last year
collabora / WhisperFusion
View on GitHub
WhisperFusion builds upon the capabilities of WhisperLive and WhisperSpeech to provide a seamless conversations with an AI.
☆1,647Jul 31, 2024Updated last year
vasistalodagala / whisper-finetune
View on GitHub
Fine-tune and evaluate Whisper models for Automatic Speech Recognition (ASR) on custom datasets or datasets from huggingface.
☆365May 23, 2023Updated 3 years ago