coqui-ai/STT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/coqui-ai/STT)

coqui-ai / STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

☆2,591

Alternatives and similar repositories for STT

Users that are interested in STT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

coqui-ai / STT-examples
View on GitHub
🐸STT integration examples
☆132Sep 23, 2022Updated 3 years ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,750Aug 16, 2024Updated last year
mozilla / DeepSpeech
View on GitHub
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Ras…
☆26,770Jun 19, 2025Updated last year
speechbrain / speechbrain
View on GitHub
A PyTorch-based Speech Toolkit
☆11,683Jun 15, 2026Updated last month
coqui-ai / open-speech-corpora
View on GitHub
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
☆1,397Jun 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
coqui-ai / snakepit
View on GitHub
🐍 Coqui's machine learning job scheduler
☆31Sep 5, 2021Updated 4 years ago
mozilla / TTS
View on GitHub
Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
☆10,161Nov 9, 2023Updated 2 years ago
alphacep / vosk-api
View on GitHub
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
☆14,937Jul 2, 2026Updated last week
coqui-ai / STT-models
View on GitHub
Open models for Coqui STT
☆153May 9, 2023Updated 3 years ago
kaldi-asr / kaldi
View on GitHub
kaldi-asr/kaldi is the official location of the Kaldi project.
☆15,425Sep 22, 2025Updated 9 months ago
espnet / espnet
View on GitHub
End-to-End Speech Processing Toolkit
☆9,889Updated this week
flashlight / wav2letter
View on GitHub
Facebook AI Research's Automatic Speech Recognition Toolkit
☆6,440Updated this week
NVIDIA-NeMo / Speech
View on GitHub
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Auto…
☆17,770Updated this week
snakers4 / silero-models
View on GitHub
Silero Models: pre-trained text-to-speech models made embarrassingly simple
☆6,011Jun 4, 2026Updated last month
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
openai / whisper
View on GitHub
Robust Speech Recognition via Large-Scale Weak Supervision
☆104,926Apr 15, 2026Updated 3 months ago
coqui-ai / TTS-papers
View on GitHub
🐸 collection of TTS papers
☆731Jul 4, 2024Updated 2 years ago
coqui-ai / TTS-recipes
View on GitHub
🐸TTS recipes for different datasets
☆88Jul 26, 2022Updated 3 years ago
lhotse-speech / lhotse
View on GitHub
Tools for handling multimodal data in machine learning projects.
☆1,141Jun 22, 2026Updated 3 weeks ago
coqui-ai / inference-engine
View on GitHub
Coqui Inference Engine
☆41Aug 3, 2021Updated 4 years ago
pyannote / pyannote-audio
View on GitHub
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker…
☆10,277Updated this week
huggingface / speechbox
View on GitHub
☆358Mar 17, 2024Updated 2 years ago
ggml-org / whisper.cpp
View on GitHub
Port of OpenAI's Whisper model in C/C++
☆51,802Updated this week
SpeechColab / GigaSpeech
View on GitHub
Large, modern dataset for speech recognition
☆730Feb 26, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
TensorSpeech / TensorFlowASR
View on GitHub
TensorFlowASR: Almost State-of-the-art Automatic Speech Recognition in Tensorflow 2. Supported languages that can use characters or subw…
☆1,009Jun 11, 2025Updated last year
coqui-ai / stt-model-manager
View on GitHub
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
☆26Mar 24, 2023Updated 3 years ago
snakers4 / silero-vad
View on GitHub
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
☆9,584Jul 3, 2026Updated last week
SYSTRAN / faster-whisper
View on GitHub
Faster Whisper transcription with CTranslate2
☆24,276Nov 19, 2025Updated 7 months ago
synesthesiam / coqui-docker
View on GitHub
Docker images for Coqui AI
☆62Jul 5, 2021Updated 5 years ago
facebookresearch / fairseq
View on GitHub
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
☆32,240Sep 30, 2025Updated 9 months ago
iceychris / LibreASR
View on GitHub
An On-Premises, Streaming Speech Recognition System
☆679Nov 28, 2021Updated 4 years ago
thorstenMueller / Thorsten-Voice
View on GitHub
Thorsten-Voice: A free to use, offline working, high quality german TTS voice should be available for every project without any license s…
☆722Jul 3, 2026Updated last week
flashlight / flashlight
View on GitHub
A C++ standalone library for machine learning
☆5,445Jun 22, 2026Updated 3 weeks ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
asticode / go-asticoqui
View on GitHub
Golang bindings for Coqui's speech-to-text library
☆34Aug 19, 2022Updated 3 years ago
syhw / wer_are_we
View on GitHub
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
☆1,864Jun 27, 2022Updated 4 years ago
m-bain / whisperX
View on GitHub
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
☆23,066Updated this week
TensorSpeech / TensorFlowTTS
View on GitHub
TensorFlowTTS: Real-Time State-of-the-art Speech Synthesis for Tensorflow 2 (supported including English, French, Korean, Chinese, Germa…
☆3,991Jul 5, 2024Updated 2 years ago
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,860Nov 19, 2024Updated last year
freewym / espresso
View on GitHub
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
☆939Sep 4, 2024Updated last year
coqui-ai / Trainer
View on GitHub
🐸 - A general purpose model trainer, as flexible as it gets
☆233Mar 7, 2024Updated 2 years ago