souvikg544 / TTS_Data_Maker
Text to speech is an emerging zone of AI. This repository helps to create a dataset with audio and transcripts for personalized text to speech .
☆27Updated 2 years ago
Alternatives and similar repositories for TTS_Data_Maker:
Users that are interested in TTS_Data_Maker are comparing it to the libraries listed below
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- ☆26Updated last year
- A TTS model that makes a speaker speak new languages☆76Updated 9 months ago
- CML-TTS: A Multilingual Dataset for Speech Synthesis☆30Updated 8 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 5 months ago
- ☆56Updated 2 years ago
- Implementation of Emo-StarGAN☆45Updated last year
- A simple voice conversion tool☆17Updated 3 years ago
- Finetuning VITS Efficiently☆32Updated last year
- 'Grad-TTS' with Multilingual Cleaners☆10Updated 11 months ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Finally, some decent sample sentences☆22Updated last year
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Audio tokenization, in the fastest way possible!☆49Updated 7 months ago
- This app is intended to automatically create a corpus for ASR systems using pseudo-labeling.☆27Updated last year
- pytorch implementation for MultiSpeech: Multi-Speaker Text to Speech with Transformer paper☆22Updated 2 years ago
- Unofficial implementation of wavenext vocoder☆44Updated 7 months ago
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆120Updated 2 years ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Convert English text from written expressions into spoken forms☆24Updated 2 years ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- BigVGAN with Neural Source-Filter☆53Updated last year
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 9 months ago