πΈSTT integration examples
β132Sep 23, 2022Updated 3 years ago
Alternatives and similar repositories for STT-examples
Users that are interested in STT-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS Client for Coqui TTS serverβ13Jan 7, 2023Updated 3 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,583Mar 11, 2024Updated 2 years ago
- π Coqui's machine learning job schedulerβ31Sep 5, 2021Updated 4 years ago
- Coqui Inference Engineβ41Aug 3, 2021Updated 4 years ago
- πΈTTS recipes for different datasetsβ89Jul 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Open models for Coqui STTβ155May 9, 2023Updated 3 years ago
- Linguistic processing for Common Voiceβ59Jan 18, 2024Updated 2 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.β30Jun 8, 2021Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.β74Oct 9, 2020Updated 5 years ago
- A living document for all things Common Voice.β14Jun 24, 2024Updated last year
- πΈ collection of TTS papersβ728Jul 4, 2024Updated last year
- A voice driven 3D chess game for learning Voice AIβ17Jul 6, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Agile reading group that worksβ13Feb 2, 2022Updated 4 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,395Jun 6, 2024Updated last year
- Tooling for producing French dataset for Common Voiceβ101Jan 20, 2025Updated last year
- Mozilla Voice Community Playbookβ48May 21, 2024Updated 2 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- β55Jan 13, 2023Updated 3 years ago
- Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeechβ110Jan 19, 2022Updated 4 years ago
- Chinese-ASR built on kaldiβ14Jan 21, 2019Updated 7 years ago
- β14Jun 12, 2015Updated 10 years ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- A library of speech gadgets.β14Oct 15, 2022Updated 3 years ago
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.htmlβ28Mar 17, 2026Updated 2 months ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ52Apr 1, 2021Updated 5 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".β11Sep 18, 2023Updated 2 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variantβ10Aug 12, 2019Updated 6 years ago
- Scraping Wikipedia for fair use sentencesβ54Jan 25, 2024Updated 2 years ago
- A Python library for working with and comparing language codes.β29Feb 26, 2026Updated 3 months ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ16Mar 26, 2022Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer β’ AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A VR180 photo viewer that works on a web browser.β11May 18, 2019Updated 7 years ago
- Ultrafast GAN based Vocoder for Text to Speechβ50Jul 16, 2022Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learningβ253May 11, 2026Updated 2 weeks ago
- Wireless Codec2 compressed audio transport over ESPNow using ESP32 micro controller.β14Mar 29, 2025Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Mar 18, 2019Updated 7 years ago
- WebUI for Whisper APIβ36Sep 14, 2024Updated last year
- REST api for mozilla deepspeech voice recognition engineβ20Nov 1, 2021Updated 4 years ago