lalalune / LJSpeechTools

Tools for making LJSpeech datasets

☆17

Related projects: ⓘ

NeuralVox / StyleTTS2
☆62Updated 4 months ago
davidmartinrius / speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆194Updated 3 months ago
ex3ndr / supervoice-vall-e-2
VALL-E 2 reproduction
☆72Updated 2 months ago
IIEleven11 / StyleTTS2FineTune
☆163Updated last month
rioharper / VocalForge
Your one-stop solution for voice dataset creation
☆106Updated 9 months ago
voicefixer / voicefixer
☆97Updated this week
0417keito / VALL-E-X-Trainer-by-CustomData
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆66Updated 11 months ago
manmay-nakhashi / tortoise-tts-fastest
Faster Tortoise inference then Tortoise Fast Fork
☆122Updated 4 months ago
rebotnix / Tortoise-TTS-Training
Community framework for training tortoise
☆36Updated last year
ex3ndr / supervoice-voicebox
VoiceBox neural network implementation
☆88Updated last month
tuanh123789 / Train_Hifigan_XTTS
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
☆54Updated last month
FENRlR / MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
☆107Updated 2 months ago
yl4579 / StyleTTS-ZS
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
☆72Updated this week
JarodMica / tortoise_dataset_tools
Misc. tools/scripts that I made to use for tortoise
☆17Updated last month
codename0og / RVC_Onnx_Infer
RVC Onnx Infer- Upgraded and simplified-ish
☆19Updated 4 months ago
neonbjb / tts-scores
Scripts for computing the Intelligibility and CLVP scores for evaluating TTS models
☆135Updated 9 months ago
sidharthrajaram / StyleTTS2
🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning
☆119Updated 2 months ago
roatienza / efficientspeech
PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.
☆149Updated 6 months ago
rishikksh20 / SoundStorm-pytorch
Google's SoundStorm: Efficient Parallel Audio Generation
☆115Updated last year
anyvoiceai / Barkify
Barkify: an unoffical training implementation of Bark TTS by suno-ai
☆122Updated last year
gitmylo / bark-data-gen
Create training data for training a voice cloner for bark text to speech.
☆44Updated last year
e-c-k-e-r / vall-e
An unofficial PyTorch implementation of VALL-E
☆68Updated this week
devilismyfriend / ozen-toolkit
Audio datasets, easier.
☆82Updated last year
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆74Updated 2 months ago
dunky11 / voicesmith
[WIP] VoiceSmith makes training text to speech models easy.
☆217Updated last year
152334H / DL-Art-School
TorToiSe fine-tuning with DLAS
☆211Updated last month
jerryuhoo / VISinger
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
☆31Updated last year
miguelvalente / whisperer
Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.
☆132Updated last year
AudiogenAI / agc
Audiogen Codec
☆116Updated 2 months ago
sh-lee-prml / PeriodWave
The official Implementation of PeriodWave and PeriodWave-Turbo
☆107Updated last month