danklabs / tts_dataset_maker
A gui to help make a text to speech dataset.
☆18Updated 2 years ago
Alternatives and similar repositories for tts_dataset_maker:
Users that are interested in tts_dataset_maker are comparing it to the libraries listed below
- Multispeaker & Emotional TTS based on Tacotron 2 and Waveglow☆129Updated 3 years ago
- VocGAN: A High-Fidelity Real-time Vocoder with a Hierarchically-nested Adversarial Network☆318Updated 5 months ago
- Tools to create your own voice dataset for TTS training☆64Updated 4 years ago
- Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to p…☆52Updated 2 years ago
- ☆129Updated last year
- ☆64Updated 3 years ago
- Tacotron 2 - PyTorch implementation with faster-than-realtime inference modified to enable cross lingual voice cloning.☆358Updated last year
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- An HTML interface for finetuning the sync map output from aeneas☆53Updated 2 years ago
- A python library to generate speech dataset from Youtube videos☆36Updated 7 months ago
- MelGAN-VC: Voice Conversion and Audio Style Transfer on arbitrarily long samples using Spectrograms☆228Updated 2 years ago
- Jupyter Notebooks for creating Speech datasets☆46Updated 5 years ago
- Voice Conversion Challenge 2020 CycleVAE baseline system☆133Updated 4 years ago
- VCTK multi-speaker tacotron for ICASSP 2020☆265Updated 2 years ago
- SC-GlowTTS: an Efficient Zero-Shot Multi-Speaker Text-To-Speech Model☆107Updated 3 years ago
- A pytorch implementation of StarGAN-VC2☆147Updated 4 years ago
- 🐸TTS recipes for different datasets☆85Updated 2 years ago
- Deep Convolution Text to Speech☆35Updated 6 years ago
- Desktop application for neural speech synthesis written in C++☆212Updated last year
- Implementation code of non-parallel sequence-to-sequence VC☆248Updated last year
- This is the implementation of the Speaker Odyssey 2020 paper " Transforming spectrum and prosody for emotional voice conversion with non-…☆123Updated 4 years ago
- Keras version of Syncnet, by Joon Son Chung and Andrew Zisserman.☆51Updated 5 years ago
- This repository is an implementation of Transfer Learning from Speaker Verification to Multispeaker Text-To-Speech Synthesis (SV2TTS) wit…☆167Updated 4 years ago
- A toolkit for non-parallel voice conversion based on vector-quantized variational autoencoder☆170Updated 5 months ago
- ☆74Updated 3 years ago
- ☆254Updated 2 years ago
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆56Updated 5 years ago
- A PyTorch implementation of "Robust Universal Neural Vocoding"☆238Updated 4 years ago
- A data annotation pipeline to generate high-quality, large-scale speech datasets with machine pre-labeling and fully manual auditing.☆100Updated last year