KathyReid / opensource-voice-toolsLinks
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆26Updated 2 years ago
Alternatives and similar repositories for opensource-voice-tools
Users that are interested in opensource-voice-tools are comparing it to the libraries listed below
Sorting:
- ☆76Updated 3 years ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- Linguistic processing for Common Voice☆55Updated last year
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆40Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.☆25Updated 4 years ago
- A simple voice conversion tool☆17Updated 3 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- automatically align transcribed audio and generate a wav2letter training corpus☆36Updated 2 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Unicode Standard tokenization routines and orthography profile segmentation☆37Updated 3 months ago
- Interface for Controllable Expressive Talking Machine☆38Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Coqui Inference Engine☆40Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆102Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated 2 years ago
- ☆36Updated last month
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆50Updated 8 months ago
- Interface for using TTS and vocoder models in the form of a text editor☆20Updated 2 years ago
- ☆14Updated 2 years ago
- 🐸STT integration examples☆128Updated 2 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 2 years ago
- ☆38Updated 3 years ago
- Rescoring methods for end-to-end Automatic Speech Recognition☆27Updated 4 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 10 months ago
- A set of tools for working with accent data in Mozilla's Common Voice dataset☆13Updated last year
- Add n-gram and large language model support to Whisper models.☆19Updated last month
- A collection of utilities for handling IPA phones.☆25Updated last year
- Command line tool to create corpora for Common Voice☆76Updated last year
- TurnGPT: a Transformer-based Language Model for Predicting Turn-taking in Spoken Dialog☆53Updated last year