coqui-ai / stt-model-managerLinks
Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zoo
β25Updated 2 years ago
Alternatives and similar repositories for stt-model-manager
Users that are interested in stt-model-manager are comparing it to the libraries listed below
Sorting:
- TTS Client for Coqui TTS serverβ13Updated 2 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 3 years ago
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with theβ¦β47Updated 2 years ago
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!β25Updated 8 months ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ34Updated 5 years ago
- My guide to create an italian TTS with Coquiβ14Updated 3 years ago
- Zero-shot multimodal punctuation insertion and truecasing using Whisperβ119Updated 2 years ago
- A simple voice conversion toolβ19Updated 3 years ago
- Putting flows on top of neural transducers for better TTSβ64Updated this week
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.β137Updated 2 years ago
- The EveryVoice TTS Toolkit - Text To Speech for your languageβ41Updated this week
- IPA Phonemizer/Dephonemizer for 139 human languagesβ44Updated last month
- A free & open tool for transcribing audio interviews with offline ASR supportβ25Updated last year
- Docker images for Coqui AIβ60Updated 4 years ago
- A high-quality, varied ~30hr voice dataset suitable for training a TTS modelβ63Updated 2 years ago
- β11Updated 2 months ago
- Convert Arpabet to IPA. Arpabet is the set of phonemes used by the CMU Pronouncing Dictionary. IPA is the International Phonetic Alphabetβ¦β44Updated 5 years ago
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ27Updated 3 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β35Updated 2 years ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- Multi-Language Dataset Cleaner/Creator for Mozilla's DeepSpeech Frameworkβ48Updated 2 years ago
- Desktop application for neural speech synthesis written in C++β213Updated 2 years ago
- A curated list of awesome voice activity detectionβ68Updated 11 months ago
- πΈSTT integration examplesβ129Updated 3 years ago
- πΌ Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decompositionβ15Updated last year
- π« check your data, before you wreck your modelβ16Updated 3 years ago
- Coqui AI TTS pluginβ87Updated 4 months ago
- Tools to create your own voice dataset for TTS trainingβ68Updated 5 years ago
- A crash course for training speech recognition models using DeepSpeech.β25Updated 4 years ago