timedomain-tech / ACE_phonemesLinks
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
☆37Updated 6 months ago
Alternatives and similar repositories for ACE_phonemes
Users that are interested in ACE_phonemes are comparing it to the libraries listed below
Sorting:
- Please visit https://thuhcsi.github.io/SnakeGAN/☆37Updated 2 years ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆53Updated 2 years ago
- Synthesized singing voice demos of WeSinger 2 paper.☆27Updated 2 years ago
- ☆43Updated 11 months ago
- ☆101Updated 11 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆29Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆71Updated 2 weeks ago
- Bilingual Singing Voice Synthesis☆18Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated last year
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆76Updated last year
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆54Updated last month
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆34Updated 2 months ago
- only rmvpe☆22Updated 2 years ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated last year
- BigVGAN with Neural Source-Filter☆55Updated last year
- ☆87Updated 2 years ago
- ☆22Updated 2 years ago
- ☆13Updated last year
- The source code for the paper XiaoiceSing2 (interspeech2023)☆47Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆70Updated last year
- ☆25Updated 2 years ago
- MFA acoustic model training based on Opencpop☆15Updated 2 years ago
- RepVgg + HiFiGAN☆34Updated 2 years ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 3 years ago
- Official Implementation for the paper: A Variational Framework for Improving Naturalness in Generative Spoken Language Models☆22Updated last month
- ☆45Updated 2 years ago
- ☆28Updated last year
- Self-supervised Generative LM-based Voice Conversion☆41Updated 3 months ago