KathyReid / opensource-voice-tools
A repo listing known open source voice tools, ordered by where they sit in the voice stack
☆25Updated last year
Related projects: ⓘ
- Simple text to phonemes converter for multiple languages☆21Updated last year
- A crash course for training speech recognition models using DeepSpeech.☆23Updated 3 years ago
- 🫠 check your data, before you wreck your model☆16Updated 2 years ago
- Forced Alignments for Common Voice☆29Updated 3 years ago
- Using YouTube to prepare a speech recognition dataset for any language☆10Updated 3 years ago
- Coqui Inference Engine☆38Updated 3 years ago
- Linguistic processing for Common Voice☆50Updated 8 months ago
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo☆24Updated last year
- MaSS - Multilingual corpus of Sentence-aligned Spoken utterances☆48Updated this week
- Interface for using TTS and vocoder models in the form of a text editor☆19Updated last year
- 🐍 Coqui's machine learning job scheduler☆31Updated 3 years ago
- Evaluation of STT models for german language☆15Updated 2 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year
- Command line tool to create corpora for Common Voice☆75Updated 3 months ago
- Unicode Standard tokenization routines and orthography profile segmentation☆31Updated 2 years ago
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!☆23Updated last year
- Analysis and investigating the confounding effect of accents in end-to-end Automatic Speech Recognition models.☆13Updated 4 years ago
- 🦁 Nala is an agile open-source voice assistant framework (20+ actions).☆35Updated last year
- TTS Client for Coqui TTS server☆13Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratory☆16Updated 5 years ago
- ☆11Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- Text to speech plugin for Mycroft using Mimic 3☆7Updated 2 years ago
- Zero-shot Audio Classification using Whisper☆74Updated last year
- Python module to clean and transliterate (i.e. normalize) German text including abbreviations, numbers, timestamps etc. It can be used to…☆29Updated 3 years ago
- A collection of utilities for handling IPA phones.☆22Updated 11 months ago
- ☆17Updated last year
- automatically align transcribed audio and generate a wav2letter training corpus☆34Updated last year
- C++ Implementation of the Information Bottleneck System☆23Updated 5 years ago