Open models for Coqui STT
β155May 9, 2023Updated 3 years ago
Alternatives and similar repositories for STT-models
Users that are interested in STT-models are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A Text-To-Speech Model Developed Using πΈSTTβ13Jun 22, 2022Updated 3 years ago
- Linguistic processing for Common Voiceβ59Jan 18, 2024Updated 2 years ago
- πΈSTT integration examplesβ132Sep 23, 2022Updated 3 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,587Mar 11, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Coqui Inference Engineβ41Aug 3, 2021Updated 4 years ago
- πΈTTS recipes for different datasetsβ89Jul 26, 2022Updated 3 years ago
- πΈ - A general purpose model trainer, as flexible as it getsβ234Mar 7, 2024Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.β30Jun 8, 2021Updated 5 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Mar 30, 2021Updated 5 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ27Mar 24, 2023Updated 3 years ago
- This is an ASR corpus for Bemba language. It contains read speech from diverse publicly available Bemba sources; Literature Books, Radio/β¦β40Jul 31, 2025Updated 10 months ago
- πΈCoqui Dialogue Audio Pack contains more than 2000 audio files of synthetic human voices over dialogue created specifically for video gaβ¦β46Mar 7, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways β’ AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Turkish Speech Recognition using Facebook's Wav2vec 2.0 modelsβ32Feb 7, 2022Updated 4 years ago
- A library of speech gadgets.β15Oct 15, 2022Updated 3 years ago
- β22Jul 8, 2021Updated 4 years ago
- Golang bindings for Coqui's speech-to-text libraryβ34Aug 19, 2022Updated 3 years ago
- This repository provides data and code for "Vox Populi, Vox DIY: Benchmark Dataset for Crowdsourced Audio Transcription" paper.β16Jul 22, 2021Updated 4 years ago
- Java Bindings for the C++ library DeepSpeechβ10Jun 4, 2020Updated 6 years ago
- A voice driven 3D chess game for learning Voice AIβ17Jul 6, 2022Updated 3 years ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β28Feb 15, 2024Updated 2 years ago
- β13Oct 27, 2021Updated 4 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Lite Voice Terminal, an "offline smart speaker" solution powered by on-premise ASR server (vosk API / kaldi engine)β19Feb 29, 2024Updated 2 years ago
- phonetic similarity algorithmsβ13Jun 19, 2018Updated 7 years ago
- Scraping Wikipedia for fair use sentencesβ54Jan 25, 2024Updated 2 years ago
- Source code for ASRU 2019 paper "Adapting Pretrained Transformer to Lattices for Spoken Language Understanding"β10Jul 8, 2020Updated 5 years ago
- This project builds a custom question answering chatbot using Langchain and Google Gemini Language Model (LLM). It fine-tunes industrial β¦β14Apr 2, 2024Updated 2 years ago
- Repository for multilingual speech data resources for native languages of Zambia.β22Oct 9, 2024Updated last year
- Script for bundling Common Voice (https://commonvoice.mozilla.org/) clips by languageβ11Apr 13, 2023Updated 3 years ago
- simple to use, pretrained/training-less models for speaker diarizationβ22Aug 23, 2023Updated 2 years ago
- DEVKIT V1 projects, BLE, WiFi and Robotics.β12Sep 30, 2025Updated 8 months ago
- Deploy on Railway without the complexity - Free Credits Offer β’ AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- REST api for mozilla deepspeech voice recognition engineβ20Nov 1, 2021Updated 4 years ago
- [APSIPA'22] Exploring Speaker Age Estimation on Different Self-Supervised Learning Modelsβ14Oct 19, 2022Updated 3 years ago
- DeepSpeech based forced alignment toolβ239Dec 12, 2020Updated 5 years ago
- On-device voice activity detection (VAD) powered by deep learningβ262Updated this week
- Create an LJSpeech structured voice dataset on wave inputβ37Sep 28, 2024Updated last year
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Jul 25, 2024Updated last year
- β18Apr 28, 2021Updated 5 years ago