0417keito / UTAUTAILinks
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
☆13Updated last year
Alternatives and similar repositories for UTAUTAI
Users that are interested in UTAUTAI are comparing it to the libraries listed below
Sorting:
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 11 months ago
- My vocoder experiments☆31Updated 2 months ago
- A TTS Trained on Universal Audio.☆39Updated 4 months ago
- ☆19Updated last year
- speaker-disentangled speech linguistic content quantizer☆22Updated 6 months ago
- Phonemes and durations labeling based on whisper small☆11Updated last year
- GPT-style network for phonemization with durations of text☆67Updated last year
- ☆28Updated last year
- An AR+AR TTS attempt.☆17Updated 8 months ago
- ☆16Updated last year
- A Neural Audio Codec (NAC) for Universal Audio☆42Updated 4 months ago
- Just another FastSpeech 2 but cleaner code :)☆27Updated last year
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Updated 3 years ago
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆28Updated 2 years ago
- pytorch model for contexless-phoneme prediction from speech audio☆29Updated last month
- GPT for FACodec☆13Updated last year
- Voice conversion with just linear regression.☆24Updated 2 weeks ago
- Codebase and project page for EDMSound☆34Updated last year
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆52Updated 2 weeks ago
- Non Parallel Voice Conversion based on VITS☆24Updated 2 years ago
- Project of Singing Voice Conversion.☆15Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated last year
- ☆14Updated last year
- ☆11Updated 11 months ago
- ☆41Updated 2 years ago
- ☆23Updated 11 months ago
- MFA acoustic model training based on Opencpop☆15Updated 3 years ago
- 60k hours of phoneme-aligned audio from audio books☆19Updated last year
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Updated last year