silenterus / deepspeech-cleaner
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
β47Updated last year
Alternatives and similar repositories for deepspeech-cleaner:
Users that are interested in deepspeech-cleaner are comparing it to the libraries listed below
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated 10 months ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 2 years ago
- Interface for Controllable Expressive Talking Machineβ38Updated last year
- This is a legacy repo. Dev occurs now on GitHub.β11Updated 4 years ago
- A python library to generate speech dataset from Youtube videosβ36Updated 9 months ago
- Deep Convolution Text to Speechβ35Updated 7 years ago
- Tools to create your own voice dataset for TTS trainingβ66Updated 4 years ago
- Artie Bias Corpus: an audio corpus + code for detecting demographic biasβ21Updated 4 years ago
- A set of scripts to use in preparing a corpus for speech-to-text processing with the Kaldi Automatic Speech Recognition Library.β15Updated 4 years ago
- Wave2vec 2.0 Recognize pipelineβ33Updated 4 years ago
- Compendium for the paper "Transparent pronunciation scoring using articulatorily weighted phoneme edit distance" by Karhila, Smolander, Yβ¦β25Updated 5 years ago
- Multilingual Grapheme to Phonemeβ49Updated 9 years ago
- A TensorFlow Implementation of Punctuation Restoration.β18Updated 4 years ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly esβ¦β19Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglowβ128Updated 3 years ago
- Command line tool to create corpora for Common Voiceβ75Updated 10 months ago
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 5 years ago
- β17Updated last year
- speaker diarization system using an LSTMβ50Updated 2 years ago
- ASRecognition: just an easy-to-use library for Automatic Speech Recognition.β51Updated 2 years ago
- β75Updated 3 years ago
- Toolbox for easy and qualitative one-shot voice conversionβ45Updated 3 years ago
- β17Updated 3 years ago