souvikg544 / TTS_Data_Maker
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to speech .
☆27Updated last year
Related projects ⓘ
Alternatives and complementary repositories for TTS_Data_Maker
- A TTS model that makes a speaker speak new languages☆75Updated 5 months ago
- ☆19Updated last year
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated 9 months ago
- ☆32Updated 2 months ago
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆61Updated last week
- Convert English text from written expressions into spoken forms☆21Updated 2 years ago
- a lightweight voice conversion☆78Updated 2 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆29Updated 3 months ago
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- Community framework for training tortoise☆38Updated 2 years ago
- The YouTube Text-To-Speech dataset is comprised of waveform audio extracted from YouTube videos alongside their English transcriptions☆50Updated 3 years ago
- 56 language, 1 model Multilingual ASR☆24Updated 3 years ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆64Updated last year
- ☆56Updated last year
- Finally, some decent sample sentences☆22Updated 11 months ago
- Finetuning VITS Efficiently☆32Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆83Updated last month
- ☆33Updated last year
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated 8 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Official Code for ParrotTTS☆42Updated last month
- Collection of scripts from mHuBERT-147.☆22Updated this week
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆19Updated 2 years ago
- Audio tokenization, in the fastest way possible!☆45Updated 2 months ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- Zero-Shot Emotion Style Transfer☆37Updated 7 months ago
- ☆62Updated 6 months ago
- Supervoice diffusion enhance☆24Updated 4 months ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆22Updated 3 months ago