Open models for Coqui STT
☆153May 9, 2023Updated 2 years ago
Alternatives and similar repositories for STT-models
Users that are interested in STT-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Text-To-Speech Model Developed Using 🐸STT☆13Jun 22, 2022Updated 3 years ago
- TTS Client for Coqui TTS server☆13Jan 7, 2023Updated 3 years ago
- Linguistic processing for Common Voice☆58Jan 18, 2024Updated 2 years ago
- 🐸STT integration examples☆130Sep 23, 2022Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Aug 11, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Coqui Inference Engine☆40Aug 3, 2021Updated 4 years ago
- 🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.☆2,577Mar 11, 2024Updated 2 years ago
- 🐸TTS recipes for different datasets☆86Jul 26, 2022Updated 3 years ago
- 🐸 - A general purpose model trainer, as flexible as it gets☆234Mar 7, 2024Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.☆30Jun 8, 2021Updated 4 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Mar 30, 2021Updated 4 years ago
- Completely free Text-to-Speech (TTS) models with excellent Turkish support and multilingual capabilities. No development, just a comprehe…☆15Jul 2, 2025Updated 8 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆26Mar 24, 2023Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/…☆38Jul 31, 2025Updated 7 months ago
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 models☆31Feb 7, 2022Updated 4 years ago
- 💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies☆1,390Jun 6, 2024Updated last year
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆13Jul 18, 2025Updated 8 months ago
- ☆22Jul 8, 2021Updated 4 years ago
- Golang bindings for Coqui's speech-to-text library☆34Aug 19, 2022Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.☆16Jul 22, 2021Updated 4 years ago
- Java Bindings for the C++ library DeepSpeech☆10Jun 4, 2020Updated 5 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A voice driven 3D chess game for learning Voice AI☆17Jul 6, 2022Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Feb 15, 2024Updated 2 years ago
- ☆13Oct 27, 2021Updated 4 years ago
- Source code for 'Transfer Learning for Speech Recognition on a Budget' published at ACL 2017☆46May 30, 2017Updated 8 years ago
- My favorite GNU/Linux flavor on the Microsoft Surface Duo.☆10Feb 7, 2024Updated 2 years ago
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)☆17Feb 29, 2024Updated 2 years ago
- Scraping Wikipedia for fair use sentences☆54Jan 25, 2024Updated 2 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"☆10Jul 8, 2020Updated 5 years ago
- This project builds a custom question answering chatbot using Langchain and Google Gemini Language Model (LLM). It fine-tunes industrial …☆14Apr 2, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- NeMo: a toolkit for conversational AI☆10Jan 18, 2023Updated 3 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Models☆14Oct 19, 2022Updated 3 years ago
- REST api for mozilla deepspeech voice recognition engine☆20Nov 1, 2021Updated 4 years ago
- DeepSpeech based forced alignment tool☆239Dec 12, 2020Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learning☆248Updated this week
- Create an LJSpeech structured voice dataset on wave input☆37Sep 28, 2024Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Jul 25, 2024Updated last year