mozilla / deepspeech-playbook
A crash course for training speech recognition models using DeepSpeech.
β24Updated 3 years ago
Alternatives and similar repositories for deepspeech-playbook:
Users that are interested in deepspeech-playbook are comparing it to the libraries listed below
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- πΈTTS recipes for different datasetsβ85Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Speech to text library for Rhasspy using Kaldiβ14Updated last year
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- β74Updated 3 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ21Updated 5 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Linguistic processing for Common Voiceβ53Updated last year
- A JAX library for building lattice-based speech transducer modelsβ43Updated 3 months ago
- Proposed splits for the LREC Wikipron paperβ14Updated 4 years ago
- Command line tool to create corpora for Common Voiceβ75Updated 9 months ago
- β56Updated 2 years ago
- docker for HF wav2vec2-sprintβ13Updated 3 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.β26Updated 7 months ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Grapheme To Phonemeβ70Updated 7 months ago
- Coqui Inference Engineβ38Updated 3 years ago
- A "Crowd-Built" continuously growing speech dataset with transcripts. The dataset contains multiple languages and is intended for anyone β¦β41Updated 2 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 4 years ago
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogniβ¦β24Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β101Updated last year
- β11Updated 3 years ago
- Labeled data for homograph disambiguationβ56Updated last year
- An online speech recognition extension toolkit of Kaldiβ56Updated 3 years ago
- β33Updated 9 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabetβ¦β43Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 5 years ago