ai16z / LJSpeechTools

Tools for making LJSpeech datasets

☆21

Related projects ⓘ

Alternatives and complementary repositories for LJSpeechTools

ex3ndr / supervoice-vall-e-2
VALL-E 2 reproduction
☆87Updated 4 months ago
tonychenxyz / emoknob
This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…
☆43Updated last month
rioharper / VocalForge
Your one-stop solution for voice dataset creation
☆112Updated 11 months ago
anyvoiceai / Barkify
Barkify: an unoffical training implementation of Bark TTS by suno-ai
☆126Updated last year
manmay-nakhashi / tortoise-tts-fastest
Faster Tortoise inference then Tortoise Fast Fork
☆122Updated 7 months ago
gitmylo / bark-data-gen
Create training data for training a voice cloner for bark text to speech.
☆44Updated last year
codename0og / RVC_Onnx_Infer
RVC Onnx Infer- Upgraded and simplified-ish
☆19Updated 6 months ago
yl4579 / StyleTTS-ZS
StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusion
☆159Updated last month
NVIDIA / RAD-MMM
A TTS model that makes a speaker speak new languages
☆75Updated 5 months ago
0417keito / VALL-E-X-Trainer-by-CustomData
An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io
☆66Updated last year
e-c-k-e-r / vall-e
An unofficial PyTorch implementation of VALL-E
☆77Updated this week
tuanh123789 / Train_Hifigan_XTTS
This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.
☆61Updated last week
rendchevi / daisy-tts
🌼 Daisy-TTS: Simulating Wider Spectrum of Emotions via Prosody Embedding Decomposition
☆15Updated 8 months ago
davidmartinrius / speech-dataset-generator
🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.
☆209Updated 5 months ago
Takaaki-Saeki / ssl_speech_restoration
SelfRemaster: SSL Speech Restoration
☆85Updated 10 months ago
anhnh2002 / XTTSv2-Finetuning-for-New-Languages
☆73Updated last month
maxrmorrison / pyfoal
Python forced alignment
☆73Updated 7 months ago
AudiogenAI / agc
Audiogen Codec
☆127Updated 4 months ago
clement-pages / gryannote
Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.
☆45Updated 2 weeks ago
myshell-ai / DreamVoice
☆81Updated 2 months ago
FENRlR / MB-iSTFT-VITS2
Application of MB-iSTFT-VITS components to vits2_pytorch
☆117Updated this week
ORI-Muchim / AudioSR-Upsampling
AudioSR-Upsampling (any -> 48kHz)
☆38Updated 9 months ago
NeuralVox / OpenPhonemizer
An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…
☆83Updated last month
rishikksh20 / SoundStorm-pytorch
Google's SoundStorm: Efficient Parallel Audio Generation
☆129Updated last year
merlresearch / cocktail-fork-separation
Baseline multi-resolution cross network model trained using the Divide and Remaster Dataset
☆77Updated 9 months ago
Edresson / ZS-TTS-Evaluation
☆32Updated 2 months ago
unilight / seq2seq-vc
A sequence-to-sequence voice conversion toolkit.
☆86Updated 4 months ago
NeuralNotW0rk / LoRAW
Flexible LoRA Implementation to use with stable-audio-tools
☆48Updated 2 months ago
sakemin / demucs_batch-multigpu
[Batching/MultiGPU/DataLoader Implemented] Code for the paper Hybrid Spectrogram and Waveform Source Separation
☆22Updated last year