coqui-ai / stt-model-managerLinks
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
β26Updated 2 years ago
Alternatives and similar repositories for stt-model-manager
Users that are interested in stt-model-manager are comparing it to the libraries listed below
Sorting:
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 2 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 years ago
- Streamlit app to visualize and edit TTS datasetsβ14Updated 3 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β23Updated 11 months ago
- SEPIA server to support open-source speech recognition via WebSocket connection.β128Updated 8 months ago
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated last year
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- Simple PyTorch Denoisers for Waveform Audioβ35Updated 2 months ago
- Tools to create your own voice dataset for TTS trainingβ67Updated 4 years ago
- OCTRA is a web-application for the orthographic transcription of audio files.β39Updated this week
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β35Updated 2 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ47Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ37Updated this week
- OpenAI Whisper Prompt Examplesβ52Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to sβ¦β28Updated 2 years ago
- Labeled data for homograph disambiguationβ59Updated 2 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Text prompt steered synthetic audio generatorsβ47Updated 3 months ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ116Updated 2 years ago
- Coqui AI TTS pluginβ80Updated last week
- IPA Phonemizer/Dephonemizer for 139 human languagesβ30Updated this week
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago
- Aiming to achieve ultimate Multilingual TTS pipeline with main focus on releasing COQUIπΈTTS(Text-to-Speech) based high performing neuralβ¦β41Updated last year
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ61Updated 2 years ago
- Tunable pipelinesβ34Updated 4 months ago
- Heteronym to Phoneme Parserβ18Updated last year
- Lyra V2 (SoundStream) running in the browserβ19Updated last year