mozilla / deepspeech-playbook
A crash course for training speech recognition models using DeepSpeech.
β25Updated 3 years ago
Alternatives and similar repositories for deepspeech-playbook:
Users that are interested in deepspeech-playbook are comparing it to the libraries listed below
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice qualityβ21Updated 5 years ago
- Command line tool to create corpora for Common Voiceβ75Updated 10 months ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- Scripts to simplify data prepping for Mozilla DeepSpeech.β14Updated 5 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- β56Updated 2 years ago
- Grapheme To Phonemeβ71Updated 8 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- Speech to text library for Rhasspy using Kaldiβ14Updated last year
- A collection of useful tools for handling speech recognition dataβ30Updated 2 years ago
- Some simple wrappers around eSpeak NG intended to make using this excellent TTS for waveform and IPA generation as convenient as possibleβ¦β41Updated 6 months ago
- C++ Implementation of the Information Bottleneck Systemβ23Updated 6 years ago
- This will hold the data pipeline to convert raw audio data to speech which will act as input dataset for speech-to-text pipelineβ32Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ49Updated 7 months ago
- A collection of basic python modules for spoken natural language processingβ56Updated 5 years ago
- Speaker diarization and speech to textβ15Updated 4 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- Linguistic processing for Common Voiceβ55Updated last year
- A recipe for creating a Speaker Identification system built on Kaldi.β15Updated 5 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ38Updated 2 years ago
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated 2 years ago
- An online speech recognition extension toolkit of Kaldiβ56Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β14Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- Support tools for punctuation and boundary detection for ASR output.β57Updated 2 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago