KathyReid / opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆26Updated 2 years ago
Alternatives and similar repositories for opensource-voice-tools:
Users that are interested in opensource-voice-tools are comparing it to the libraries listed below
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- ☆74Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Evaluation of STT models for german language☆15Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆49Updated 6 months ago
- Code for AccentDB.☆20Updated 3 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Kaldi style neural network training in pytorch for use in place of nnet3 in Kaldi.☆26Updated 7 months ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆101Updated last year
- asr2k☆49Updated 9 months ago
- Linguistic processing for Common Voice☆53Updated last year
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Explore different way to mix speech model(wav2vec2, hubert) and nlp model(BART,T5,GPT) together☆45Updated last year
- phone inventory library☆16Updated last year
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- ☆10Updated this week
- 24-hour Automatic Speech Recognition☆27Updated 3 years ago
- A simple voice conversion tool☆17Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated last year
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆15Updated 4 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆37Updated 2 years ago
- This repository is for wake-word detection in speech using recurrent neural networks☆17Updated 6 years ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- Grapheme To Phoneme☆70Updated 7 months ago
- Dataset Release for Intent Classification from Speech☆46Updated 3 weeks ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago