coqui-ai/STT-models

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/coqui-ai/STT-models)

coqui-ai / STT-models

Open models for Coqui STT

☆153

Alternatives and similar repositories for STT-models

Users that are interested in STT-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

thorstenMueller / cTTS
View on GitHub
TTS Client for Coqui TTS server
☆13Jan 7, 2023Updated 3 years ago
Oct4Pie / persian-stt
View on GitHub
A Text-To-Speech Model Developed Using 🐸STT
☆13Jun 22, 2022Updated 4 years ago
ftyers / commonvoice-utils
View on GitHub
Linguistic processing for Common Voice
☆59Jan 18, 2024Updated 2 years ago
coqui-ai / data-checker
View on GitHub
🫠 check your data, before you wreck your model
☆16Aug 11, 2022Updated 3 years ago
coqui-ai / STT-examples
View on GitHub
🐸STT integration examples
☆132Sep 23, 2022Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
coqui-ai / STT
View on GitHub
🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
☆2,596Mar 11, 2024Updated 2 years ago
coqui-ai / inference-engine
View on GitHub
Coqui Inference Engine
☆41Aug 3, 2021Updated 4 years ago
coqui-ai / TTS-recipes
View on GitHub
🐸TTS recipes for different datasets
☆88Jul 26, 2022Updated 4 years ago
coqui-ai / Trainer
View on GitHub
🐸 - A general purpose model trainer, as flexible as it gets
☆233Mar 7, 2024Updated 2 years ago
coqui-ai / stt-model-manager
View on GitHub
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
☆26Mar 24, 2023Updated 3 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
TehreemFarooqi / Preparing-a-speech-recognition-dataset-using-YouTube-videos
View on GitHub
Using YouTube to prepare a speech recognition dataset for any language
☆10Mar 30, 2021Updated 5 years ago
solyarisoftware / CoquiSTTJs
View on GitHub
Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.
☆30Jun 8, 2021Updated 5 years ago
csikasote / BembaSpeech
View on GitHub
This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…
☆41Jul 31, 2025Updated 11 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
abhi227070 / Custom-Question-Answering-Chatbot-using-Langchain-and-Gemini-AI
View on GitHub
This project builds a custom question answering chatbot using Langchain and Google Gemini Language Model (LLM). It fine-tunes industrial …
☆14Apr 2, 2024Updated 2 years ago
coqui-ai / coqui-voice-pack
View on GitHub
🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…
☆46Mar 7, 2023Updated 3 years ago
mpoyraz / wav2vec2-turkish
View on GitHub
Turkish Speech Recognition using Facebook's Wav2vec 2.0 models
☆33Feb 7, 2022Updated 4 years ago
falabrasil / ufpalign
View on GitHub
👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro
☆13Jul 18, 2025Updated last year
SpeechColab / PySpeechColab
View on GitHub
A library of speech gadgets.
☆15Oct 15, 2022Updated 3 years ago
pilot7747 / VoxDIY
View on GitHub
This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.
☆16Jul 22, 2021Updated 5 years ago
mikex86 / DeepSpeech-Java-Bindings
View on GitHub
Java Bindings for the C++ library DeepSpeech
☆10Jun 4, 2020Updated 6 years ago
Picovoice / cobra
View on GitHub
On-device voice activity detection (VAD) powered by deep learning
☆266Updated this week
caixxiong / espeak-data
View on GitHub
☆15Dec 12, 2019Updated 6 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
egorsmkv / asr-corpus-creator
View on GitHub
This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.
☆27Feb 15, 2024Updated 2 years ago
yuhangear / wenet-android
View on GitHub
☆13Oct 27, 2021Updated 4 years ago
BreckoEC / share-extension-expo-example
View on GitHub
☆10Mar 8, 2023Updated 3 years ago
unza-speech-lab / zambezi-voice
View on GitHub
Repository for multilingual speech data resources for native languages of Zambia.
☆22Oct 9, 2024Updated last year
transfer-learning-asr / transfer-learning-asr
View on GitHub
Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017
☆46May 30, 2017Updated 9 years ago
mosave / LVTerminal
View on GitHub
Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)
☆19Feb 29, 2024Updated 2 years ago
common-voice / cv-sentence-extractor
View on GitHub
Scraping Wikipedia for fair use sentences
☆54Jan 25, 2024Updated 2 years ago
resemble-ai / NeMo
View on GitHub
NeMo: a toolkit for conversational AI
☆10Jan 18, 2023Updated 3 years ago
MiuLab / Lattice-Transformer-SLU
View on GitHub
Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"
☆10Jul 8, 2020Updated 6 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
crowsonkb / pharmacokinetics
View on GitHub
A Flask web application to calculate and plot drug concentration over time.
☆15Jan 1, 2019Updated 7 years ago
zelo / deepspeech-rest-api
View on GitHub
REST api for mozilla deepspeech voice recognition engine
☆20Nov 1, 2021Updated 4 years ago
bugbakery / pydiar
View on GitHub
simple to use, pretrained/training-less models for speaker diarization
☆22Aug 23, 2023Updated 2 years ago
ductuantruong / speaker_age_estimation_ssl_study
View on GitHub
[APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models
☆14Oct 19, 2022Updated 3 years ago
mozilla / DSAlign
View on GitHub
DeepSpeech based forced alignment tool
☆239Dec 12, 2020Updated 5 years ago
kevindegila / flask-joey
View on GitHub
A Simple Flask App to interact with your Machine Translation Model
☆13Feb 26, 2020Updated 6 years ago
thorstenMueller / Audio-to-Voice-Dataset
View on GitHub
Create an LJSpeech structured voice dataset on wave input
☆38Sep 28, 2024Updated last year