KathyReid / opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆25Updated 2 years ago
Alternatives and similar repositories for opensource-voice-tools:
Users that are interested in opensource-voice-tools are comparing it to the libraries listed below
- ☆74Updated 3 years ago
- A crash course for training speech recognition models using DeepSpeech.☆24Updated 3 years ago
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆49Updated 4 months ago
- Command line tool to create corpora for Common Voice☆75Updated 7 months ago
- Forced Alignments for Common Voice☆31Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Linguistic processing for Common Voice☆52Updated last year
- Simple text to phonemes converter for multiple languages☆20Updated 2 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- TTS Client for Coqui TTS server☆13Updated 2 years ago
- Speech to text library for Rhasspy using Kaldi☆14Updated last year
- SylNet: An Adaptable End-to-End Syllable Count Estimator for Speech☆25Updated last year
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated 2 years ago
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- Spoken Language Identification on Common Voice and AudioSet using Deep Learning☆37Updated 2 years ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆25Updated last year
- Unicode Standard tokenization routines and orthography profile segmentation☆34Updated 2 years ago
- Code for AccentDB.☆19Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Mycroft's multilingual text parsing and formatting library☆75Updated last year
- Evaluation of STT models for german language☆15Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- ☆32Updated 3 years ago
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- This repository contains the implementation of the paper: "Span Classification with Structured Information for Disfluency Detection in Sp…☆12Updated last year
- Grapheme to phoneme model for PyTorch☆40Updated 2 years ago
- Emotive Speech generation based on DAVID: An open-source platform for real-time emotional speech transformation using pysox☆13Updated 6 years ago
- Identifying individual speakers in an audio stream based on the unique characteristics found in individual voices using Python☆18Updated last year
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated 2 years ago
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆31Updated 4 years ago