KathyReid / opensource-voice-toolsLinks
A repo listing known open source voice tools, ordered by where they sit in the voice stack
β26Updated 2 years ago
Alternatives and similar repositories for opensource-voice-tools
Users that are interested in opensource-voice-tools are comparing it to the libraries listed below
Sorting:
- β76Updated 3 years ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ50Updated 10 months ago
- Dataset Release for Intent Classification from Speechβ47Updated 4 months ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- NPTEL2020: Speech2Text dataset for Indian-English Accentβ77Updated 3 years ago
- Linguistic processing for Common Voiceβ55Updated last year
- Command line tool to create corpora for Common Voiceβ77Updated last year
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Coqui Inference Engineβ40Updated 3 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ41Updated 2 years ago
- β56Updated 2 years ago
- Rescoring methods for end-to-end Automatic Speech Recognitionβ27Updated 4 years ago
- Unicode Standard tokenization routines and orthography profile segmentationβ37Updated 4 months ago
- Code for AccentDB.β22Updated 4 years ago
- Feature extractor for DL speech processing.β66Updated 3 years ago
- asr2kβ51Updated last year
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 4 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- β17Updated 4 years ago
- A repository with comprehensive instructions for using the Festvox toolkit for generating Emotional speech from textβ48Updated 2 years ago
- Datasets for turn-taking researchβ14Updated last year
- Mycroft's multilingual text parsing and formatting libraryβ76Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.β27Updated last year
- Simple Python library, distributed via binary wheels with few direct dependencies, for easily using wav2vec 2.0 models for speech recogniβ¦β24Updated 3 years ago
- Add n-gram and large language model (LLM) support to Whisper models.β29Updated 2 months ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago