ishine / vc-lm
将任意人的音色转换为成千上万种不同音色
☆27Updated last year
Alternatives and similar repositories for vc-lm:
Users that are interested in vc-lm are comparing it to the libraries listed below
- Huawei Grad-TTS for Chinese☆48Updated last year
- ☆64Updated last year
- Streaming Text to Speech Web UI☆16Updated 10 months ago
- Chinese and English Bilinguish G2P☆20Updated last year
- noise reduction☆17Updated 8 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- faster inference☆27Updated 2 months ago
- ☆20Updated 5 months ago
- Singing Voice Speech modeling test☆35Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆92Updated 2 weeks ago
- ☆17Updated 5 months ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆93Updated 3 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 9 months ago
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- 基于vits fastspeech2 visinger的tts模型☆23Updated 2 years ago
- Sovits5 with RMVPE☆14Updated last year
- Torchaudio Forced Aligner for Mixed Chinese (Mandarin or Cantonese) and English.☆11Updated 3 months ago
- [ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation☆19Updated 7 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆85Updated last year
- ☆39Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated last year
- Official implementation of "Automatic Tuning of Loss Trade-offs without Hyper-parameter Search in End-to-End Zero-Shot Speech Synthesis",…☆79Updated last year
- only rmvpe☆22Updated last year
- ☆56Updated last year
- ☆33Updated last year
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆48Updated 8 months ago
- dog-can-sing-song☆22Updated 5 months ago