πΈSTT integration examples
β132Sep 23, 2022Updated 3 years ago
Alternatives and similar repositories for STT-examples
Users that are interested in STT-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS Client for Coqui TTS serverβ13Jan 7, 2023Updated 3 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,583Mar 11, 2024Updated 2 years ago
- π Coqui's machine learning job schedulerβ31Sep 5, 2021Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ27Mar 24, 2023Updated 3 years ago
- Coqui Inference Engineβ41Aug 3, 2021Updated 4 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- πΈTTS recipes for different datasetsβ89Jul 26, 2022Updated 3 years ago
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Open models for Coqui STTβ155May 9, 2023Updated 3 years ago
- Linguistic processing for Common Voiceβ59Jan 18, 2024Updated 2 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.β30Jun 8, 2021Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.β74Oct 9, 2020Updated 5 years ago
- A living document for all things Common Voice.β14Jun 24, 2024Updated last year
- πΈ collection of TTS papersβ727Jul 4, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- A voice driven 3D chess game for learning Voice AIβ17Jul 6, 2022Updated 3 years ago
- Agile reading group that worksβ13Feb 2, 2022Updated 4 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,393Jun 6, 2024Updated last year
- Tooling for producing French dataset for Common Voiceβ101Jan 20, 2025Updated last year
- Mozilla Voice Community Playbookβ48May 21, 2024Updated last year
- β32Jan 6, 2022Updated 4 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- β55Jan 13, 2023Updated 3 years ago
- Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeechβ110Jan 19, 2022Updated 4 years ago
- Open source password manager - Proton Pass β’ AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- Example workflow for our data-centric speech benchmarkβ17Jul 6, 2023Updated 2 years ago
- Chinese-ASR built on kaldiβ14Jan 21, 2019Updated 7 years ago
- β14Jun 12, 2015Updated 10 years ago
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- A library of speech gadgets.β14Oct 15, 2022Updated 3 years ago
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.htmlβ28Mar 17, 2026Updated last month
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ52Apr 1, 2021Updated 5 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".β11Sep 18, 2023Updated 2 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variantβ10Aug 12, 2019Updated 6 years ago
- End-to-end encrypted cloud storage - Proton Drive β’ AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Scraping Wikipedia for fair use sentencesβ54Jan 25, 2024Updated 2 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 4 years ago
- β15Sep 13, 2022Updated 3 years ago
- On-device voice activity detection (VAD) powered by deep learningβ250Apr 17, 2026Updated 3 weeks ago
- Ultrafast GAN based Vocoder for Text to Speechβ50Jul 16, 2022Updated 3 years ago
- Wireless Codec2 compressed audio transport over ESPNow using ESP32 micro controller.β14Mar 29, 2025Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Mar 18, 2019Updated 7 years ago