souvikg544 / TTS_Data_MakerLinks
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to speech .
☆28Updated 2 years ago
Alternatives and similar repositories for TTS_Data_Maker
Users that are interested in TTS_Data_Maker are comparing it to the libraries listed below
Sorting:
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- KATube is a tool to automate the process of creating datasets for training Text-To-Speech (TTS) and Speech-To-Text (STT) models. From a l…☆23Updated 10 months ago
- A simple voice conversion tool☆17Updated 3 years ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆19Updated 2 weeks ago
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆16Updated last year
- ☆14Updated last year
- 'Grad-TTS' with Multilingual Cleaners☆10Updated last year
- Misc. tools/scripts that I made to use for tortoise☆21Updated 9 months ago
- Create training data for training a voice cloner for bark text to speech.☆45Updated last year
- ☆25Updated last year
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆28Updated 2 years ago
- RVC Onnx Infer- Upgraded and simplified-ish☆21Updated last year
- The Vokan Architecture (Tsukasa speech based)☆9Updated 3 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆29Updated 4 months ago
- create dataset from list of youtube links easily☆18Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆31Updated 10 months ago
- Convert English text from written expressions into spoken forms☆25Updated 2 years ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- ☆15Updated 3 months ago
- A python library to generate speech dataset from Youtube videos☆36Updated last year
- 56 language, 1 model Multilingual ASR☆25Updated 3 years ago
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆21Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆15Updated 3 weeks ago
- SpeechGLUE is a speech version of the GLUE benchmark, driven by text-to-speech.☆13Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆25Updated 6 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆98Updated 7 months ago
- My vocoder experiments☆29Updated 7 months ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆29Updated last year