swagger-coder / visinger_lab
为visinger SVS系统写的展示系统~本质仍然是个音乐播放器
☆11Updated 2 years ago
Alternatives and similar repositories for visinger_lab
Users that are interested in visinger_lab are comparing it to the libraries listed below
Sorting:
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆28Updated 4 months ago
- ☆64Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆36Updated 3 months ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆47Updated last year
- Huawei Grad-TTS for Chinese☆50Updated last year
- Streaming Vocos☆24Updated 4 months ago
- ☆39Updated last year
- noise reduction☆17Updated 10 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆62Updated this week
- ☆55Updated 2 years ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆32Updated last year
- Self-supervised Generative LM-based Voice Conversion☆34Updated 2 weeks ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆19Updated 3 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆49Updated 9 months ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- An open-source Kazakh Emotional Text-to-Speech Dataset☆28Updated last year
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆70Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated 10 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆38Updated 11 months ago
- This is the official train-dev-test release of the Interspeech2024 Discrete Speech Representation Challenge.☆32Updated last year
- The Official Implementation of “Content-Dependent Fine-Grained Speaker Embedding for Zero-Shot Speaker Adaptation in Text-to-Speech Synth…☆85Updated 2 years ago
- ☆33Updated 2 months ago
- ☆20Updated 6 months ago
- Official implementation of Mozart's Touch: A Lightweight Multi-modal Music Generation Framework Based on Pre-Trained Large Models☆34Updated 2 months ago
- ☆13Updated 6 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆95Updated 4 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 11 months ago
- ☆56Updated last year