Open models for Coqui STT
☆155May 9, 2023Updated 3 years ago
Alternatives and similar repositories for STT-models
Users that are interested in STT-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- A Text-To-Speech Model Developed Using 🐸STT☆13Jun 22, 2022Updated 3 years ago
- Linguistic processing for Common Voice☆59Jan 18, 2024Updated 2 years ago
- 🐸STT integration examples☆132Sep 23, 2022Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,583Mar 11, 2024Updated 2 years ago
- Coqui Inference Engine☆41Aug 3, 2021Updated 4 years ago
- 🐸TTS recipes for different datasets☆89Jul 26, 2022Updated 3 years ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆234Mar 7, 2024Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆30Jun 8, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 5 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆27Mar 24, 2023Updated 3 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆39Jul 31, 2025Updated 9 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🐸Coqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video ga…☆46Mar 7, 2023Updated 3 years ago
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆32Feb 7, 2022Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 9 months ago
- ☆22Jul 8, 2021Updated 4 years ago
- Golang bindings for Coqui's speech-to-text library☆34Aug 19, 2022Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Completely free Text-to-Speech (TTS) models with excellent Turkish support and multilingual capabilities. No development, just a comprehe…☆18Jul 2, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- A voice driven 3D chess game for learning Voice AI☆17Jul 6, 2022Updated 3 years ago
- Nendo plugin for MusicGen: A state-of-the-art controllable text-to-music model (by Meta Research)☆17Mar 19, 2024Updated 2 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46May 30, 2017Updated 8 years ago
- ☆10Mar 8, 2023Updated 3 years ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆17Feb 29, 2024Updated 2 years ago
- phonetic similarity algorithms☆13Jun 19, 2018Updated 7 years ago
- Scraping Wikipedia for fair use sentences☆54Jan 25, 2024Updated 2 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This project builds a custom question answering chatbot using Langchain and Google Gemini Language Model (LLM). It fine-tunes industrial …☆14Apr 2, 2024Updated 2 years ago
- NeMo: a toolkit for conversational AI☆10Jan 18, 2023Updated 3 years ago
- Repository for multilingual speech data resources for native languages of Zambia.☆20Oct 9, 2024Updated last year
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by language☆11Apr 13, 2023Updated 3 years ago
- A Flask web application to calculate and plot drug concentration over time.☆15Jan 1, 2019Updated 7 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Nov 1, 2021Updated 4 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago