youmebangbang / TTS-dataset-tools
Automatically generates TTS dataset using audio and associated text. Make cuts under a custom length. Uses Google Speech to text API to perform diarization and transcription or aeneas to force align text to audio.
☆50Updated 2 years ago
Related projects: ⓘ
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆86Updated 3 years ago
- ☆25Updated this week
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- Your one-stop solution for voice dataset creation☆106Updated 9 months ago
- Community framework for training tortoise☆36Updated last year
- Neural Voice Cloning with a few voice samples, using the speaker adaptation method. Speaker adaptation is based on fine-tuning a multi-sp…☆57Updated 5 years ago
- ☆64Updated 3 years ago
- Demo for 2022 ICASSP☆64Updated 2 years ago
- A python library to generate speech dataset from Youtube videos☆35Updated 3 months ago
- GradioUI for TortoiseTTS voice generation☆33Updated 11 months ago
- Simple text to phonemes converter for multiple languages☆21Updated last year
- ☆20Updated last year
- demo page https://MingjieChen.github.io/dygan-vc☆67Updated 2 years ago
- A simple voice conversion tool☆15Updated 2 years ago
- ☆23Updated this week
- A gui to help make a text to speech dataset.☆18Updated last year
- Audio datasets, easier.☆82Updated last year
- Collect Voice Conversion researches☆90Updated this week
- Create an LJSpeech structured voice dataset on wave input☆16Updated 2 months ago
- Timbre transfer with variational autoencoding and cycle-consistent adversarial networks. Able to transfer the timbre of an audio source t…☆65Updated 2 years ago
- ☆128Updated last year
- Finally, some decent sample sentences☆21Updated 9 months ago
- Tools to create your own voice dataset for TTS training☆58Updated 3 years ago
- DLAS - A configuration-driven trainer for generative models☆136Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆46Updated last year
- Text prompt steered synthetic audio generators☆44Updated 9 months ago
- Interface for Controllable Expressive Talking Machine☆37Updated 8 months ago
- ☆56Updated this week
- Coqui AI TTS plugin☆65Updated last week
- Demo for 2022 Interspeech☆29Updated 2 years ago