πΈSTT integration examples
β130Sep 23, 2022Updated 3 years ago
Alternatives and similar repositories for STT-examples
Users that are interested in STT-examples are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TTS Client for Coqui TTS serverβ13Jan 7, 2023Updated 3 years ago
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,577Mar 11, 2024Updated 2 years ago
- π Coqui's machine learning job schedulerβ31Sep 5, 2021Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Mar 24, 2023Updated 3 years ago
- Coqui Inference Engineβ40Aug 3, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- πΈTTS recipes for different datasetsβ86Jul 26, 2022Updated 3 years ago
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- Open models for Coqui STTβ153May 9, 2023Updated 2 years ago
- Linguistic processing for Common Voiceβ58Jan 18, 2024Updated 2 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.β30Jun 8, 2021Updated 4 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.β74Oct 9, 2020Updated 5 years ago
- A living document for all things Common Voice.β14Jun 24, 2024Updated last year
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting β’ AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- A voice driven 3D chess game for learning Voice AIβ17Jul 6, 2022Updated 3 years ago
- Agile reading group that worksβ13Feb 2, 2022Updated 4 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,390Jun 6, 2024Updated last year
- Tooling for producing French dataset for Common Voiceβ101Jan 20, 2025Updated last year
- Mozilla Voice Community Playbookβ48May 21, 2024Updated last year
- β32Jan 6, 2022Updated 4 years ago
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- β55Jan 13, 2023Updated 3 years ago
- Android Speech Recognition Service using Vosk/Kaldi and Mozilla DeepSpeechβ109Jan 19, 2022Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient β’ AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Example workflow for our data-centric speech benchmarkβ17Jul 6, 2023Updated 2 years ago
- Chinese-ASR built on kaldiβ14Jan 21, 2019Updated 7 years ago
- β14Jun 12, 2015Updated 10 years ago
- scipts for working with open.bible dataβ26Jan 24, 2022Updated 4 years ago
- A library of speech gadgets.β14Oct 15, 2022Updated 3 years ago
- A proof-of-concept app using KeenASR SDK on Android. WE ARE HIRING: https://keenresearch.com/careers.htmlβ28Mar 17, 2026Updated last week
- Resources for "Simple Speech Representation Learning from Perceptual Data".β11Sep 18, 2023Updated 2 years ago
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variantβ10Aug 12, 2019Updated 6 years ago
- Scraping Wikipedia for fair use sentencesβ54Jan 25, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 4 years ago
- β15Sep 13, 2022Updated 3 years ago
- A VR180 photo viewer that works on a web browser.β11May 18, 2019Updated 6 years ago
- On-device voice activity detection (VAD) powered by deep learningβ248Updated this week
- My favorite GNU/Linux flavor on the Microsoft Surface Duo.β10Feb 7, 2024Updated 2 years ago
- Ultrafast GAN based Vocoder for Text to Speechβ50Jul 16, 2022Updated 3 years ago
- Wireless Codec2 compressed audio transport over ESPNow using ESP32 micro controller.β13Mar 29, 2025Updated 11 months ago