andrewsilva9 / tune_tortoise_autoregressor
Fine tuning the UnifiedVoice autoregressor for TortoiseTTS.
☆15Updated last year
Alternatives and similar repositories for tune_tortoise_autoregressor:
Users that are interested in tune_tortoise_autoregressor are comparing it to the libraries listed below
- ☆41Updated last year
- ☆26Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated 4 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆74Updated last year
- This repository contains prompts & best practices to annotate audio clips with a very high degree of details using Audio-Language-Models☆33Updated 6 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- ☆35Updated last year
- ☆33Updated last year
- Pytorch implementation of LearnableUpsamplingLayer (NaturalSpeech, Tan et al., 2022)☆54Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- ☆20Updated 2 years ago
- ☆71Updated last year
- All generative model in one for better TTS model☆67Updated 7 months ago
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- PyTorch Implementation of ViT-TTS (EMNLP'23)☆10Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2 and HiFi-GAN for End to End Text to Speech☆22Updated 2 years ago
- Official implementation of the paper "Laughter Synthesis using Pseudo Phonetic Tokens with a Large-scale In-the-wild Laughter Corpus" acc…☆75Updated last year
- ☆39Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- ☆38Updated 7 months ago
- 4G GPU & 10 Minutes for train☆12Updated last year
- An AR+AR TTS attempt.☆15Updated 3 months ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆29Updated 2 years ago
- ☆69Updated last year
- Temporary anonymous version☆22Updated last year
- ☆45Updated 2 years ago
- PyTorch Implementation of Daft-Exprt: Robust Prosody Transfer Across Speakers for Expressive Speech Synthesis☆56Updated 3 years ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆69Updated last year
- Finetuning VITS Efficiently☆32Updated last year