lalalune / LJSpeechTools
Tools for making LJSpeech datasets
ā17Updated 7 months ago
Related projects: ā
- ā62Updated 4 months ago
- š Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. š§š„š Advanced audio processing.ā194Updated 3 months ago
- VALL-E 2 reproductionā72Updated 2 months ago
- ā163Updated last month
- Your one-stop solution for voice dataset creationā106Updated 9 months ago
- ā97Updated this week
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.ioā66Updated 11 months ago
- Faster Tortoise inference then Tortoise Fast Forkā122Updated 4 months ago
- Community framework for training tortoiseā36Updated last year
- VoiceBox neural network implementationā88Updated last month
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.ā54Updated last month
- Application of MB-iSTFT-VITS components to vits2_pytorchā107Updated 2 months ago
- StyleTTS-ZS: Efficient High-Quality Zero-Shot Text-to-Speech Synthesis with Distilled Time-Varying Style Diffusionā72Updated this week
- Misc. tools/scripts that I made to use for tortoiseā17Updated last month
- RVC Onnx Infer- Upgraded and simplified-ishā19Updated 4 months ago
- Scripts for computing the Intelligibility and CLVP scores for evaluating TTS modelsā135Updated 9 months ago
- š š¤ Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloningā119Updated 2 months ago
- PyTorch code implementation of EfficientSpeech - to be presented at ICASSP2023.ā149Updated 6 months ago
- Google's SoundStorm: Efficient Parallel Audio Generationā115Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-aiā122Updated last year
- Create training data for training a voice cloner for bark text to speech.ā44Updated last year
- An unofficial PyTorch implementation of VALL-Eā68Updated this week
- Audio datasets, easier.ā82Updated last year
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GPā¦ā74Updated 2 months ago
- [WIP] VoiceSmith makes training text to speech models easy.ā217Updated last year
- TorToiSe fine-tuning with DLASā211Updated last month
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.ā31Updated last year
- Go from raw audio files to a text-audio dataset automatically with OpenAI's Whisper.ā132Updated last year
- Audiogen Codecā116Updated 2 months ago
- The official Implementation of PeriodWave and PeriodWave-Turboā107Updated last month