thorstenMueller / cTTS
TTS Client for Coqui TTS server
β13Updated 2 years ago
Alternatives and similar repositories for cTTS:
Users that are interested in cTTS are comparing it to the libraries listed below
- Coqui STT Model Manager - install, manage and try out Coqui STT models from the Model Zooβ25Updated last year
- Interface for using TTS and vocoder models in the form of a text editorβ19Updated 2 years ago
- π« check your data, before you wreck your modelβ16Updated 2 years ago
- A free & open tool for transcribing audio interviews with offline ASR supportβ24Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription usingβ¦β28Updated last year
- This will hold the crowdsourcing platform to be used to store voice data from various speakers which will act as input dataset for speechβ¦β17Updated last year
- A deep-learning powered accessibility application which turns pdfs into audio files. Featuring ocr improvement and tts with inflection!β23Updated this week
- Simple PyTorch Denoisers for Waveform Audioβ34Updated 2 months ago
- Simple text to phonemes converter for multiple languagesβ20Updated 2 years ago
- StyleTTS 2 Optimized Training Forkβ22Updated 2 weeks ago
- Coqui Inference Engineβ38Updated 3 years ago
- A very basic demonstration connecting speech recognition and text-to-speechβ19Updated 4 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a lβ¦β22Updated 6 months ago
- Audio samples accompanying publications related to DF-Conformer, a speech enhancement model.β20Updated last year
- Using YouTube to prepare a speech recognition dataset for any languageβ10Updated 3 years ago
- π Coqui's machine learning job schedulerβ32Updated 3 years ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.β35Updated last year
- A crash course for training speech recognition models using DeepSpeech.β24Updated 3 years ago
- Stable timestamps and confidence score for words of OpenAI's Whisper outputs down to word-level.β25Updated 2 years ago
- β8Updated last year
- An even smaller speech recognizer / force alignerβ32Updated 2 months ago
- My guide to create an italian TTS with Coquiβ14Updated 3 years ago
- A bidirectional recurrent neural network model with attention mechanism for restoring missing punctuation in unsegmented textβ36Updated 4 years ago
- πΉ pyannote + π notebook = pyannotebookβ26Updated last year
- A repo listing known open source voice tools, ordered by where they sit in the voice stackβ26Updated 2 years ago
- Streamlit app to visualize and edit TTS datasetsβ14Updated 3 years ago
- β11Updated 9 years ago
- Code for the winning solution in the SE&R 2022 Challenge - SER track.β13Updated last year
- This repository contains all the code necessary for running the multilingual distilwhisper from Ferraz et al. 2024 IEEE ICASSP paper.β20Updated 11 months ago