danklabs / tts_dataset_makerLinks
A gui to help make a text to speech dataset.
☆18Updated 3 years ago
Alternatives and similar repositories for tts_dataset_maker
Users that are interested in tts_dataset_maker are comparing it to the libraries listed below
Sorting:
- ☆63Updated 4 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆360Updated 2 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆229Updated 3 years ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baidu☆253Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 4 years ago
- 🐸TTS recipes for different datasets☆86Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆321Updated last year
- This repository has implementation for "Neural Voice Cloning With Few Samples"☆436Updated 4 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 6 years ago
- ☆130Updated 2 years ago
- Performant and accurate speech recognition built on Pytorch☆254Updated 3 years ago
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆256Updated 6 years ago
- Deepstory turns a text/generated text into a video where the character is animated to speak your story using his/her voice.☆101Updated last month
- Jupyter Notebooks for creating Speech datasets☆46Updated 6 years ago
- Text to Speech with PyTorch (English and Mongolian)☆187Updated last year
- Keras implementations of Tacotron-2☆27Updated 4 years ago
- Tools to create your own voice dataset for TTS training☆70Updated 5 years ago
- DLAS - A configuration-driven trainer for generative models☆141Updated 3 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated last year
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆171Updated 5 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆266Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆89Updated 4 years ago
- Tensorflow Implementation of Expressive Tacotron☆196Updated 7 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 4 years ago
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"☆210Updated last year
- Conditional lyrics generator -> pre-trained GPT2 model fine-tuned on lyrics with features dataset.☆42Updated 5 years ago
- Pytorch implementation of Deepmind's WaveRNN model☆123Updated 6 years ago
- ☆14Updated 7 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 3 years ago
- Audio style transfer with shallow random parameters CNN.☆406Updated 10 months ago