manmay-nakhashi / TTS_dataset_creator
create dataset from list of youtube links easily
☆17Updated last year
Alternatives and similar repositories for TTS_dataset_creator:
Users that are interested in TTS_dataset_creator are comparing it to the libraries listed below
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆60Updated 2 weeks ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 7 months ago
- StyleTTS 2 Optimized Training Fork☆26Updated last month
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆94Updated 5 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- A collection of all our phonemeizers for dataset construction and inference☆22Updated last month
- ☆56Updated 9 months ago
- ☆71Updated last year
- Your one-stop solution for voice dataset creation☆118Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 11 months ago
- Open TTS models, built for streaming on the edge☆39Updated 2 weeks ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆68Updated 5 months ago
- 🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition☆15Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆71Updated 4 months ago
- a Frontier Japanese Speech Generation net☆28Updated 2 weeks ago
- A simple voice conversion tool☆17Updated 3 years ago
- Finally, some decent sample sentences☆22Updated last year
- ☆26Updated last year
- Official implementation of the TTS model Lina-Speech☆157Updated 2 months ago
- ☆35Updated 11 months ago
- VoiceBox neural network implementation☆105Updated 7 months ago
- Zero-Shot Emotion Style Transfer☆43Updated 11 months ago
- ☆69Updated last year
- ☆24Updated last year
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆22Updated last week
- Putting flows on top of neural transducers for better TTS☆62Updated 3 weeks ago