mozilla / deepspeech-playbookLinks
A crash course for training speech recognition models using DeepSpeech.
β24Updated 4 years ago
Alternatives and similar repositories for deepspeech-playbook
Users that are interested in deepspeech-playbook are comparing it to the libraries listed below
Sorting:
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ40Updated 3 years ago
- β76Updated 4 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Updated 3 years ago
- Trained models for automatic speech recognition (ASR). A library to quickly build applications that require speech to text conversion.β130Updated 4 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 6 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β106Updated 2 years ago
- Advanced data structures for handling temporal segments with attached labels.β124Updated 4 months ago
- Forced Alignments for Common Voiceβ32Updated 5 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- End to End Dialect Identification using Convolutional Neural Networkβ53Updated 6 years ago
- Phoneme prediction from speech mel-spectrograms using RNN.β15Updated 6 years ago
- A collection of basic python modules for spoken natural language processingβ55Updated 6 years ago
- Automatic Speech Recognition Dataset Generationβ37Updated 7 years ago
- A recipe for creating a Speaker Identification system built on Kaldi.β15Updated 6 years ago
- πΈSTT integration examplesβ130Updated 3 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ123Updated 6 years ago
- Server & client for DeepSpeech using WebSockets for real-time speech recognition in separate environments