swagger-coder / visinger_labLinks
为visinger SVS系统写的展示系统~本质仍然是个音乐播放器
☆11Updated 2 years ago
Alternatives and similar repositories for visinger_lab
Users that are interested in visinger_lab are comparing it to the libraries listed below
Sorting:
- This repo is text to speech with learnable audio encoder without alignment with transcript reference☆53Updated 4 months ago
- ☆24Updated 2 years ago
- ☆68Updated 2 years ago
- Music generation☆25Updated last year
- ☆15Updated last year
- ☆55Updated 3 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Updated 2 years ago
- Streaming Vocos☆29Updated 7 months ago
- ☆23Updated last year
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆50Updated 4 months ago
- ☆18Updated 2 years ago
- ☆25Updated 7 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated 2 years ago
- ☆19Updated 3 years ago
- Speech-To-Text forced-alignment Speech processing Universal PERformance Benchmark☆35Updated 9 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆71Updated 2 years ago
- BEGANSing - Korean SVS + SVC + AudioSR☆11Updated last year
- a guide to grapheme-to-phoneme conversion and phoneme list for ace singing voice synthesis engine☆41Updated last year
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆86Updated last week
- GPT-style network for phonemization with durations of text☆68Updated last year
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Updated 2 years ago
- The implementation of paper "SpeechTripleNet: End-to-End Disentangled Speech Representation Learning for Content, Timbre and Prosody"☆34Updated 2 years ago
- faster inference☆28Updated last year
- Try to replicate the architecture of MiniMaxTTS mentioned in it's technical report☆49Updated 5 months ago
- Re-implementation of SLAM-ASR paper's experiment, using Phi-2 and Hubert☆21Updated last year
- Phonemes and durations labeling based on whisper small☆11Updated last year
- Chinese polyphone disambiguation for Text-to-Speech application☆42Updated last year
- Huawei Grad-TTS for Chinese☆51Updated 2 years ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆44Updated 11 months ago
- ☆19Updated last year