0417keito / UTAUTAI
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
☆12Updated last year
Alternatives and similar repositories for UTAUTAI:
Users that are interested in UTAUTAI are comparing it to the libraries listed below
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated 5 months ago
- ☆29Updated last year
- ☆41Updated last year
- ☆18Updated 11 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 10 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 8 months ago
- ☆13Updated last year
- MFA acoustic model training based on Opencpop☆14Updated 2 years ago
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Project of Singing Voice Conversion.☆14Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆48Updated last year
- ☆20Updated 6 months ago
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆53Updated last year
- [IJCAI'23] Learning to Speak from Text for Low-Resource TTS☆63Updated last year
- Source code of APNet2, a vocoder☆53Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆61Updated 2 months ago
- ☆22Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆29Updated 2 years ago
- iSeparate library for the SDX2023 challenge☆13Updated last year
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆75Updated 6 months ago
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆23Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated 2 years ago
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆32Updated 10 months ago
- Just another FastSpeech 2 but cleaner code :)☆26Updated 9 months ago
- Official implementation for FlowSep☆42Updated 3 months ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates [WIP]☆24Updated 2 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Updated last year
- Simple and lightweight Zero-shot Text-to-Speech (TTS) synthesis model☆19Updated last week