SJTMusicTeam / MusicGeneration
☆11Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for MusicGeneration
- DNN based singing voice synthesis☆17Updated 6 years ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆19Updated last year
- Singing Voice Speech modeling test☆35Updated 2 years ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆22Updated 7 months ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆32Updated this week
- Bilingual Singing Voice Synthesis☆14Updated 7 months ago
- ☆19Updated last year
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆38Updated 2 months ago
- MFA acoustic model training based on Opencpop☆12Updated 2 years ago
- Please visit https://thuhcsi.github.io/SnakeGAN/☆36Updated last year
- 60k hours of phoneme-aligned audio from audio books☆18Updated 3 months ago
- A minimum inference engine for DiffSinger☆34Updated 7 months ago
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!☆29Updated 4 years ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆12Updated 2 years ago
- An imporved version of Fastsinging singing voice synthesising system.☆20Updated 4 years ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆65Updated 4 months ago
- Labels for kiritan_singing data with extra resources for DNN-based singing voice synthesis (SVS) systems.☆29Updated 10 months ago
- A unified model for zero-shot singing voice conversion and synthesis☆21Updated last year
- Cover Song Detection System☆10Updated 5 years ago
- ☆21Updated 7 months ago
- music semantic understanding evaluation benchmark☆25Updated last year
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- A toy-like Text-to-Speech for Chinese/Mandarin synthesize, inspired by Tacotron & FastSpeech2 & RefineGAN.☆15Updated 2 years ago
- An unofficial implementation of "UniCATS: A Unified Context-Aware Text-to-Speech Framework with Contextual VQ-Diffusion and Vocoding".☆22Updated last year
- Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications☆63Updated 2 months ago
- A piano music dataset with Audio, Symbolic and Text labels☆18Updated this week
- with alignment learning and continuous wavelet transform☆19Updated 2 years ago
- ☆44Updated last year