timedomain-tech / ACE_phonemes
a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine
☆36Updated 2 months ago
Alternatives and similar repositories for ACE_phonemes:
Users that are interested in ACE_phonemes are comparing it to the libraries listed below
- ☆22Updated 11 months ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- Singing Voice Speech modeling test☆35Updated 2 years ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆21Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 8 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆25Updated 11 months ago
- ☆12Updated 2 months ago
- ☆22Updated last year
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆36Updated 6 months ago
- ☆38Updated 6 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- Pitch Controllable DDSP Vocoders☆71Updated 4 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆25Updated 2 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆57Updated last month
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆66Updated 11 months ago
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- ☆15Updated last month
- Adaptive Vocoder for Custom Voice☆59Updated 2 years ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆48Updated last year
- ☆45Updated last year
- Bilingual Singing Voice Synthesis☆18Updated last year
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆67Updated 8 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆79Updated 3 months ago
- ☆43Updated 9 months ago
- ☆22Updated last month
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆34Updated last year
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- Cover Song Detection System☆10Updated 5 years ago