danklabs / tts_dataset_makerLinks
A gui to help make a text to speech dataset.
β18Updated 2 years ago
Alternatives and similar repositories for tts_dataset_maker
Users that are interested in tts_dataset_maker are comparing it to the libraries listed below
Sorting:
- β63Updated 4 years ago
- πΈTTS recipes for different datasetsβ86Updated 3 years ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.β89Updated 4 years ago
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglowβ129Updated 4 years ago
- Performant and accurate speech recognition built on Pytorchβ253Updated 3 years ago
- DLAS - A configuration-driven trainer for generative modelsβ139Updated 2 years ago
- Text to Speech with PyTorch (English and Mongolian)β185Updated 11 months ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to pβ¦β52Updated 3 years ago
- β130Updated 2 years ago
- Implementation of Neural Voice Cloning with Few Samples Research Paper by Baiduβ254Updated 4 years ago
- WIP: Open Source Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"β256Updated 6 years ago
- A python library to generate speech dataset from Youtube videosβ36Updated last year
- This repository has implementation for "Neural Voice Cloning With Few Samples"β436Updated 4 years ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrogramsβ229Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.β360Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-spβ¦β57Updated 6 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Modelβ107Updated 4 years ago
- Jupyter Notebooks for creating Speech datasetsβ46Updated 6 years ago
- VCTK multi-speaker tacotron for ICASSP 2020β265Updated 3 years ago
- Implementation of "MelNet: A Generative Model for Audio in the Frequency Domain"β211Updated last year
- Ensemble of Neural Tools for Animations Restorationβ65Updated 2 years ago
- An HTML interface for finetuning the sync map output from aeneasβ53Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Networkβ320Updated last year
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source tβ¦β68Updated 3 years ago
- DeepSpeech based forced alignment toolβ239Updated 4 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) witβ¦β170Updated 4 years ago
- Pytorch implementation of Deepmind's WaveRNN modelβ121Updated 6 years ago
- Walk through insanely commented code for an advanced recurrent model in TensorFlowβ48Updated 7 years ago
- flask+tornado based NVIDIA tacotron2+waveglow tts web appβ29Updated 2 years ago
- β© Generating speech in a single forward pass without any attention!β579Updated last year