silenterus / deepspeech-cleanerLinks
Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Framework
β47Updated 2 years ago
Alternatives and similar repositories for deepspeech-cleaner
Users that are interested in deepspeech-cleaner are comparing it to the libraries listed below
Sorting:
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- πΈTTS recipes for different datasetsβ87Updated 2 years ago
- Speaker diarization python system based on binary key speaker modellingβ60Updated 3 years ago
- SailAlign is an open-source software toolkit for robust long speech-text alignment implementing an adaptive, iterative speech recognitionβ¦β98Updated 3 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Command line tool to create corpora for Common Voiceβ77Updated last year
- Multilingual Grapheme to Phonemeβ49Updated 9 years ago
- Helsinki Prosody Corpus and A System for Predicting Prosodic Prominence from Textβ242Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 5 years ago
- Code for Speaker Change Detection in Broadcast TV using Bidirectional Long Short-Term Memory Networksβ65Updated 4 years ago
- Incorporating KenLM language model with HuggingFace implementation of Wav2Vec2CTC Model using beam search decodingβ75Updated 3 years ago
- End-to-end spoken language identification out of the box.β48Updated 4 years ago
- Live demo for speech emotion recognition using Keras and Tensorflow modelsβ39Updated 10 months ago
- β80Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- Tool for creation, manipulation and maintenance of voice corporaβ81Updated last year
- 24-hour Automatic Speech Recognitionβ27Updated 4 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 2 years ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabetβ¦β44Updated 4 years ago
- An easy way to fine-tune Wav2Vec 2.0 for low-resource languages.β82Updated 2 years ago
- Deep learning for Text to Speechβ27Updated 4 years ago
- Web app for keyword spotting using TensorflowJSβ72Updated 2 years ago
- A list of publically available audio data that anyone can download for ASR or other speech activitiesβ209Updated 3 years ago
- Use your data to create a speech recognition system in Kaldi. Fast.β65Updated 5 years ago
- β37Updated 2 months ago
- Tensorflow Implementation of Expressive Tacotronβ196Updated 6 years ago
- Simple Kaldi model server for chain (nnet3) models in online recognition mode directly from a local microphoneβ35Updated 3 years ago
- A Collection of Speech Corpus for ASR and TTSβ114Updated 8 years ago
- A collection of basic python modules for spoken natural language processingβ56Updated 5 years ago
- A repository for dictionaries to be used with the Prosodylab-Alignerβ17Updated 11 years ago