0417keito / UTAUTAILinks
UTAUTAI(Unrestricted Tune Automated Technology Artificial Interigence)
☆13Updated 2 years ago
Alternatives and similar repositories for UTAUTAI
Users that are interested in UTAUTAI are comparing it to the libraries listed below
Sorting:
- My vocoder experiments☆31Updated 3 months ago
- ☆19Updated last year
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆17Updated last year
- ☆16Updated last year
- Voice conversion with just linear regression.☆28Updated last month
- An implementation of Charactr, Inc's "WavThruVec: Latent speech representation as intermediate features for neural speech synthesis"☆29Updated 2 years ago
- speaker-disentangled speech linguistic content quantizer☆22Updated 7 months ago
- A TTS Trained on Universal Audio.☆39Updated 4 months ago
- ☆41Updated 2 years ago
- ☆28Updated last year
- Just another FastSpeech 2 but cleaner code :)☆27Updated last year
- An AR+AR TTS attempt.☆18Updated 9 months ago
- Incorporating AutoVocoder to MB-iSTFT-VITS☆48Updated 2 years ago
- Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.☆30Updated 3 years ago
- GPT for FACodec☆13Updated last year
- A Neural Audio Codec (NAC) for Universal Audio☆42Updated 5 months ago
- 60k hours of phoneme-aligned audio from audio books☆19Updated last year
- Phonemes and durations labeling based on whisper small☆11Updated last year
- Codebase and project page for EDMSound☆35Updated last year
- Multispeaker Community Vocoder Model for DiffSinger☆38Updated 2 months ago
- ☆14Updated last year
- Official repository for NAST: Noise Aware Speech Tokenization for Speech Language Models (Interspeech 2024) https://arxiv.org/abs/2406.11…☆46Updated last year
- Accompanying repository for the paper "DiffVox: A Differentiable Model for Capturing and Analysing Professional Effects Distributions"☆35Updated this week
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆31Updated 2 years ago
- Collection of scripts from mHuBERT-147.☆31Updated 11 months ago
- ☆25Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆65Updated last year
- PyTorch implementation of Miipher-2 [2025] which is a speech restoration model by Google DeepMind☆52Updated last month
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆40Updated last year
- Official Code for ParrotTTS☆57Updated last year