coqui-ai / stt-model-manager
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
β24Updated last year
Related projects β
Alternatives and complementary repositories for stt-model-manager
- TTS Client for Coqui TTS serverβ13Updated last year
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated 11 months ago
- Streamlit app to visualize and edit TTS datasetsβ14Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β22Updated 3 months ago
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- πΈTTS recipes for different datasetsβ84Updated 2 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ25Updated 2 years ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ21Updated this week
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ55Updated last year
- Coqui AI TTS pluginβ69Updated 2 months ago
- 24-hour Automatic Speech Recognitionβ27Updated 3 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.β35Updated this week
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated last year
- Running Mozilla's implementation of Baidu DeepSpeech on Google Colaboratoryβ16Updated 5 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β32Updated last year
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β46Updated last year
- Tools to create your own voice dataset for TTS trainingβ61Updated 4 years ago
- A minimalistic automatic speech recognition streamlit based webapp powered by OpenAI's Whisper "State of the Art" modelsβ65Updated 2 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!β23Updated last year
- Tunable pipelinesβ30Updated last month
- Implementation of Google's USM speech model in Pytorchβ25Updated last week
- Evaluation of STT models for german languageβ15Updated 2 years ago
- Forced Alignments for Common Voiceβ31Updated 4 years ago
- Public voice datasets used for our Text-to-Speech voices.β30Updated 3 months ago