cyanbx / Prompt-Singer
Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).
☆85Updated 3 weeks ago
Related projects ⓘ
Alternatives and complementary repositories for Prompt-Singer
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆65Updated 4 months ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆81Updated 2 weeks ago
- Robust Singing Voice Transcription and MIDI Extraction☆55Updated 3 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆54Updated 7 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- Music generation☆24Updated 6 months ago
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆129Updated 7 months ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆91Updated last week
- ☆70Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆76Updated last year
- ☆34Updated 4 months ago
- CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆183Updated 6 months ago
- ☆41Updated 3 weeks ago
- [InterSpeech 24] FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter☆78Updated 4 months ago
- All generative model in one for better TTS model☆66Updated 2 months ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆128Updated 3 weeks ago
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆81Updated last year
- The source code for the paper XiaoiceSing2 (interspeech2023)☆44Updated 9 months ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆49Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- ☆77Updated 2 months ago
- The open source code for SimpleSpeech series☆108Updated last month
- Emotion Rendering for Conversational Speech Synthesis with Heterogeneous Graph-Based Context Modeling (Accepted by AAAI'2024)☆51Updated 4 months ago
- ☆13Updated 5 months ago
- ☆65Updated last year
- E2E TTS using Conditional Flow Matching (Experimental*)☆66Updated last year
- ☆66Updated last year
- ☆62Updated last year
- Implementation of Emo-StarGAN☆46Updated 10 months ago
- ☆39Updated last year