mozilla / deepspeech-playbook
A crash course for training speech recognition models using DeepSpeech.
☆24Updated 3 years ago
Alternatives and similar repositories for deepspeech-playbook:
Users that are interested in deepspeech-playbook are comparing it to the libraries listed below
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stack☆26Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Gentle and praatio scripts for easy forced alignment☆18Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Tool to analyze an audio corpora in terms of intonation, intensity, duration and voice quality☆21Updated 5 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Command line tool to create corpora for Common Voice☆75Updated 8 months ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆37Updated 2 years ago
- It is an algorithm analysed the acoustic features of a voice and creates an acoustic classifier - USEFUL for auto-speech-rater☆11Updated 5 years ago
- Linguistic processing for Common Voice☆53Updated last year
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- A collection of useful tools for handling speech recognition data☆30Updated 2 years ago
- A new metric for evaluating end-to-end speech recognition and disfluency removal systems☆19Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- ☆74Updated 3 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- Labeled data for homograph disambiguation☆55Updated last year
- Breaks a word into syllables using an LSTM-based neural network.☆19Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆34Updated 2 years ago
- C++ Implementation of the Information Bottleneck System☆23Updated 6 years ago
- A JAX library for building lattice-based speech transducer models☆42Updated last month
- Demo and samples for universal speech translator☆23Updated 2 years ago
- Parallelized automatic corpus collection for ASR. Forked from https://github.com/EgorLakomkin/KTSpeechCrawler☆23Updated 3 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.☆13Updated last year
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphone☆36Updated 2 years ago