danklabs / tts_dataset_maker
A gui to help make a text to speech dataset.
☆18Updated 2 years ago
Alternatives and similar repositories for tts_dataset_maker:
Users that are interested in tts_dataset_maker are comparing it to the libraries listed below
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆128Updated 4 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆228Updated 3 years ago
- ☆130Updated 2 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆53Updated 3 years ago
- ☆64Updated 4 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Updated 2 years ago
- A pytorch implementation of StarGAN-VC2☆147Updated 4 years ago
- Tools to create your own voice dataset for TTS training☆66Updated 4 years ago
- 🐸TTS recipes for different datasets☆87Updated 2 years ago
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆211Updated 9 months ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆319Updated 9 months ago
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆251Updated 5 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- Text to Speech with PyTorch (English and Mongolian)☆185Updated 7 months ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- MelGAN vocoder (compatible with NVIDIA/tacotron2)☆644Updated 4 years ago
- A tensorflow implementation of the "Style Tokens: Unsupervised Style Modeling, Control and Transfer in End-to-End Speech Synthesis"☆367Updated 6 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆239Updated 4 years ago
- DLAS - A configuration-driven trainer for generative models☆139Updated 2 years ago
- A pytroch implementation of the GAN-TTS: HIGH FIDELITY SPEECH SYNTHESIS WITH ADVERSARIAL NETWORKS☆231Updated 5 years ago
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.☆40Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆121Updated 5 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆171Updated 9 months ago
- Official implementation of VQMIVC: One-shot (any-to-any) Voice Conversion @ Interspeech 2021 + Online playing demo!☆350Updated 3 years ago
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source t…☆68Updated 3 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu☆254Updated 4 years ago
- Audio style transfer with shallow random parameters CNN.☆404Updated 2 months ago