πΈSTT integration examples
β130Sep 23, 2022Updated 3 years ago
Alternatives and similar repositories for STT-examples
Users that are interested in STT-examples are comparing it to the libraries listed below
Sorting:
- πΈSTT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.β2,567Mar 11, 2024Updated last year
- TTS Client for Coqui TTS serverβ13Jan 7, 2023Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ26Mar 24, 2023Updated 2 years ago
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.β13Feb 13, 2021Updated 5 years ago
- π Coqui's machine learning job schedulerβ31Sep 5, 2021Updated 4 years ago
- Coqui Inference Engineβ40Aug 3, 2021Updated 4 years ago
- Open models for Coqui STTβ153May 9, 2023Updated 2 years ago
- Evaluation of STT models for german languageβ15Jan 22, 2022Updated 4 years ago
- πΈTTS recipes for different datasetsβ86Jul 26, 2022Updated 3 years ago
- β14Jun 12, 2015Updated 10 years ago
- Segment a given audio into utterances using a trained end-to-end ASR model.β74Oct 9, 2020Updated 5 years ago
- A living document for all things Common Voice.β14Jun 24, 2024Updated last year
- π« check your data, before you wreck your modelβ16Aug 11, 2022Updated 3 years ago
- Coqui STT offline engine API for NodeJs developers. With a simple HTTP ASR server.β30Jun 8, 2021Updated 4 years ago
- Linguistic processing for Common Voiceβ58Jan 18, 2024Updated 2 years ago
- πΈ collection of TTS papersβ723Jul 4, 2024Updated last year
- Example workflow for our data-centric speech benchmarkβ17Jul 6, 2023Updated 2 years ago
- β55Jan 13, 2023Updated 3 years ago
- C++ Implementation of the Information Bottleneck Systemβ22Jan 9, 2019Updated 7 years ago
- π A list of accessible speech corpora for ASR, TTS, and other Speech Technologiesβ1,386Jun 6, 2024Updated last year
- Python package for noise supression in audio based on DNNβ22Mar 24, 2023Updated 2 years ago
- Adapt Kaldi-ASR nnet3 chain models from Zamia-Speech.org to a different language modelβ33Jan 26, 2020Updated 6 years ago
- Resources for "Simple Speech Representation Learning from Perceptual Data".β11Sep 18, 2023Updated 2 years ago
- A VR180 photo viewer that works on a web browser.β11May 18, 2019Updated 6 years ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β11Aug 7, 2023Updated 2 years ago
- Mozilla Voice Community Playbookβ48May 21, 2024Updated last year
- Grapheme to phoneme converter for Estonianβ14May 27, 2021Updated 4 years ago
- DEVKIT V1 projects, BLE, WiFi and Robotics.β11Sep 30, 2025Updated 5 months ago
- A library of speech gadgets.β14Oct 15, 2022Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.β24May 16, 2021Updated 4 years ago
- Ultrafast GAN based Vocoder for Text to Speechβ50Jul 16, 2022Updated 3 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptionsβ52Apr 1, 2021Updated 4 years ago
- unofficial pytorch implementation of HiFi-GAN with fast MISR.β15Mar 21, 2023Updated 2 years ago
- This is an extension of kaldi speech recognition software which allows to perform decoding of speech with hybrid word and phoneme graphs.β¦β11Feb 4, 2020Updated 6 years ago
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challengeβ15Mar 26, 2022Updated 3 years ago
- recent audio generation papers (including speech, music and general audios)β13Mar 14, 2023Updated 2 years ago
- rendering engine for Blender 4.3β13Jan 19, 2025Updated last year
- PyTorch speech2text inference script for the NVidia openseq2seq wav2letter model variantβ10Aug 12, 2019Updated 6 years ago
- β32Jan 6, 2022Updated 4 years ago