TiffanyBlews / MozartsTouch
Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models
☆18Updated last week
Related projects: ⓘ
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆62Updated 2 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆68Updated 2 months ago
- Music generation☆24Updated 4 months ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆61Updated 2 months ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆43Updated 8 months ago
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆31Updated last month
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆64Updated 5 months ago
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆44Updated last month
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆58Updated 5 months ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆31Updated 11 months ago
- trying to reproduce suno v3☆23Updated 5 months ago
- Source code of APNet2, a vocoder☆49Updated 9 months ago
- ☆33Updated 2 months ago