KathyReid / opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
β26Updated 2 years ago
Alternatives and similar repositories for opensource-voice-tools:
Users that are interested in opensource-voice-tools are comparing it to the libraries listed below
- πΈTTS recipes for different datasetsβ87Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated 2 years ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 3 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- β75Updated 3 years ago
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterancesβ49Updated 7 months ago
- Speech to text library for Rhasspy using Kaldiβ14Updated last year
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- Evaluation of STT models for german languageβ15Updated 3 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.β102Updated 2 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learningβ39Updated 2 years ago
- A collection of utilities for handling IPA phones.β25Updated last year
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 3 years ago
- Interface for Controllable Expressive Talking Machineβ38Updated last year
- automatically align transcribed audio and generate a wav2letter training corpusβ36Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β23Updated 9 months ago
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ30Updated this week
- β14Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 6 years ago
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Spβ¦β12Updated last year
- Unicode Standard tokenization routines and orthography profile segmentationβ37Updated 2 months ago
- Gentle and praatio scripts for easy forced alignmentβ18Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used toβ¦β32Updated 4 years ago
- β11Updated 3 years ago
- β17Updated 4 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Prosodic Speech Segmentation with Transformersβ25Updated last year