swagger-coder / visinger_labLinks
为visinger SVS系统写的展示系统~本质仍然是个音乐播放器
☆11Updated 2 years ago
Alternatives and similar repositories for visinger_lab
Users that are interested in visinger_lab are comparing it to the libraries listed below
Sorting:
- ☆68Updated 2 years ago
- ☆24Updated 2 years ago
- ☆55Updated 3 years ago
- Music generation☆25Updated last year
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Updated 2 years ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆80Updated last month
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Updated 4 months ago
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆70Updated 2 months ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Updated 6 months ago
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆41Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆55Updated 3 years ago
- Make-An-Audio-3: Transforming Text/Video into Audio via Flow-based Large Diffusion Transformers☆118Updated 8 months ago
- g2p for english tts☆19Updated 3 years ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated 2 years ago
- Huawei Grad-TTS for Chinese☆51Updated 2 years ago
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆50Updated 4 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆119Updated last year
- ☆23Updated last year
- 基于vits fastspeech2 visinger的tts模型☆24Updated 2 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Updated last year
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆61Updated last year
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆122Updated 5 months ago
- An Open-Source Project to Unify Audio Processing and Generation☆174Updated this week
- Implementation of Frieren: Efficient Video-to-Audio Generation Network with Rectified Flow Matching (NeurIPS'24)☆59Updated 9 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆88Updated last year
- A unified tokenizer that is capable of both extracting semantic information and enabling high-fidelity audio reconstruction.☆131Updated 4 months ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Updated 2 years ago
- ☆39Updated 2 years ago