0417keito / UTAUTAI
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
☆11Updated last year
Alternatives and similar repositories for UTAUTAI:
Users that are interested in UTAUTAI are comparing it to the libraries listed below
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆15Updated 4 months ago
- ☆41Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated last year
- ☆18Updated 9 months ago
- Project of Singing Voice Conversion.☆14Updated last year
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated last year
- ☆28Updated last year
- (R&D) Text to speech using phonemes as inputs and audio codec codes as outputs. Loosely based on MegaByte, VALL-E and Encodec.☆47Updated last year
- Aligner for text-to-speech☆14Updated 7 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 8 months ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆69Updated 4 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 6 months ago
- My vocoder experiments☆26Updated 4 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Automatic speech annotator processing speech with voice activaty detection, overlapping speech detection, speaker diarization and automat…☆32Updated 8 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆48Updated 2 weeks ago
- Zero-Shot Emotion Style Transfer☆41Updated 10 months ago
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆31Updated 3 months ago
- StyleTTS 2 Optimized Training Fork☆22Updated 2 weeks ago
- ☆22Updated last year
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- ☆34Updated 10 months ago
- Source code of APNet2, a vocoder☆54Updated last year
- ONNX deployment of the CREPE pitch tracker☆20Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- Contains the code associated with the ICLR submission for our text-to-speech diffusion model☆51Updated last year
- ☆13Updated last year
- FCTalker: Fine and Coarse Grained Context Modeling for Expressive Conversational Speech Synthesis (Accepted by ISCSLP'2024)☆22Updated last year
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆44Updated 7 months ago