manmay-nakhashi / TTS_dataset_creator
create dataset from list of youtube links easily
☆17Updated last year
Alternatives and similar repositories for TTS_dataset_creator:
Users that are interested in TTS_dataset_creator are comparing it to the libraries listed below
- Your one-stop solution for voice dataset creation☆117Updated last year
- An unofficial PyTorch implementation of VALL-E☆87Updated this week
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 9 months ago
- Google's SoundStorm: Efficient Parallel Audio Generation☆131Updated last year
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆57Updated last week
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆93Updated 4 months ago
- Create training data for training a voice cloner for bark text to speech.☆43Updated last year
- Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.☆15Updated last year
- Demo for 2022 ICASSP☆64Updated 2 years ago
- Unsupervised Rhythm Modeling for Voice Conversion☆80Updated last year
- Generative voice cloning model using TTS synthesis with state-of-the-art Zero-Shot Multi-Speaker functionality. An web api built with the…☆47Updated 2 years ago
- A simple voice conversion tool☆17Updated 2 years ago
- Finally, some decent sample sentences☆22Updated last year
- ☆20Updated 2 years ago
- StyleTTS 2 Optimized Training Fork☆22Updated 2 weeks ago
- Community framework for training tortoise☆40Updated 2 years ago
- audiolm-pytorch training code☆15Updated last year
- SLMGAN: Exploiting Speech Language Model Representations for Unsupervised Zero-Shot Voice Conversion in GANs☆15Updated last year
- ☆62Updated 6 months ago
- [Last Updated 2021] TTS from Cookie. Messy and experimental!☆43Updated last year
- ☆36Updated 5 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆66Updated 4 months ago
- 🔊 Text-prompted Generative Audio Model - With the ability to clone voices☆20Updated last year
- ☆68Updated last year
- SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.☆76Updated last month
- Misc. tools/scripts that I made to use for tortoise☆22Updated 6 months ago
- TalkNet 2: Non-Autoregressive Depth-Wise Separable Convolutional Model for Speech Synthesis with Explicit Pitch and Duration Prediction.☆88Updated 3 years ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- ☆28Updated last year
- Official Implementation of StyleTTS-VC☆175Updated last month