cyanbx / Prompt-Singer
Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).
☆87Updated last month
Related projects ⓘ
Alternatives and complementary repositories for Prompt-Singer
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆65Updated 4 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆83Updated 3 weeks ago
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆132Updated 8 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆58Updated this week
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆54Updated 7 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Music generation☆24Updated 6 months ago
- official code for CVPR'24 paper Diff-BGM☆47Updated last month
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆93Updated 3 weeks ago
- ☆34Updated 5 months ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆76Updated last year
- ☆65Updated last year
- ☆83Updated 2 months ago
- ☆70Updated 2 years ago
- CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆184Updated 6 months ago
- ☆45Updated this week
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆66Updated last month
- E2E TTS using Conditional Flow Matching (Experimental*)☆66Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆75Updated 2 months ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆145Updated last month
- ☆66Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆71Updated 7 months ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆49Updated last year
- The open source code for SimpleSpeech series☆111Updated last month
- Train the next generation of TTS systems.☆161Updated 2 months ago
- " Music Style Transfer with Time-Varying Inversion of Diffusion Models"☆35Updated 4 months ago
- ☆62Updated last year
- The official implementation of EmoSphere-TTS☆85Updated 3 months ago
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆51Updated 5 months ago