Open models for Coqui STT
☆155May 9, 2023Updated 3 years ago
Alternatives and similar repositories for STT-models
Users that are interested in STT-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- A Text-To-Speech Model Developed Using 🐸STT☆13Jun 22, 2022Updated 3 years ago
- Linguistic processing for Common Voice☆59Jan 18, 2024Updated 2 years ago
- 🐸STT integration examples☆132Sep 23, 2022Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- 🐸TTS recipes for different datasets☆89Jul 26, 2022Updated 3 years ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆234Mar 7, 2024Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆30Jun 8, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆39Jul 31, 2025Updated 9 months ago
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆46Mar 7, 2023Updated 3 years ago
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆32Feb 7, 2022Updated 4 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,395Jun 6, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 10 months ago
- ☆22Jul 8, 2021Updated 4 years ago
- Golang bindings for Coqui's speech-to-text library☆34Aug 19, 2022Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Completely free Text-to-Speech (TTS) models with excellent Turkish support and multilingual capabilities. No development, just a comprehe…☆22Jul 2, 2025Updated 10 months ago
- A voice driven 3D chess game for learning Voice AI☆17Jul 6, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Nendo plugin for MusicGen: A state-of-the-art controllable text-to-music model (by Meta Research)☆17Mar 19, 2024Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46May 30, 2017Updated 8 years ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆19Feb 29, 2024Updated 2 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- This project builds a custom question answering chatbot using Langchain and Google Gemini Language Model (LLM). It fine-tunes industrial …☆14Apr 2, 2024Updated 2 years ago
- NeMo: a toolkit for conversational AI☆10Jan 18, 2023Updated 3 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆21Oct 9, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Apr 13, 2023Updated 3 years ago
- simple to use, pretrained/training-less models for speaker diarization☆22Aug 23, 2023Updated 2 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Nov 1, 2021Updated 4 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learning☆253May 11, 2026Updated 2 weeks ago
- Create an LJSpeech structured voice dataset on wave input☆37Sep 28, 2024Updated last year