danklabs / tts_dataset_maker
A gui to help make a text to speech dataset.
☆18Updated last year
Related projects ⓘ
Alternatives and complementary repositories for tts_dataset_maker
- ☆64Updated 3 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 3 years ago
- ☆129Updated last year
- A python library to generate speech dataset from Youtube videos☆35Updated 5 months ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆50Updated 2 years ago
- Tools to create your own voice dataset for TTS training☆61Updated 4 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆228Updated 2 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆106Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- 🐸TTS recipes for different datasets☆84Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source t…☆65Updated 2 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 3 months ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆247Updated 2 years ago
- CSS10: A Collection of Single Speaker Speech Datasets for 10 Languages☆465Updated 4 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web app☆28Updated last year
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 4 years ago
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆208Updated 3 months ago
- ☆251Updated last year
- DLAS - A configuration-driven trainer for generative models☆137Updated 2 years ago
- Official repository of STYLER: Style Factor Modeling with Rapidity and Robustness via Speech Decomposition for Expressive and Controllabl…☆158Updated 2 years ago
- Implementation code of non-parallel sequence-to-sequence VC☆250Updated last year
- Audio Denoising with Deep Network Priors☆163Updated 4 years ago
- An unofficial implementation of the paper "AutoVC: Zero-Shot Voice Style Transfer with Only Autoencoder Loss".☆33Updated 3 years ago
- This is a phonemic multilingual (Russian-English) Implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-S…☆51Updated 4 years ago
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆229Updated 4 years ago
- Collect Voice Conversion researches☆90Updated this week