ishine / vc-lm
将任意人的音色转换为成千上万种不同音色
☆26Updated last year
Alternatives and similar repositories for vc-lm:
Users that are interested in vc-lm are comparing it to the libraries listed below
- Chinese and English Bilinguish G2P☆20Updated last year
- ☆20Updated 4 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆92Updated 2 months ago
- ☆24Updated this week
- UMETTS: A Unified Framework for Emotional Text-to-Speech Synthesis with Multimodal Prompts☆19Updated 2 months ago
- 单独维护的中文TTS☆35Updated 2 years ago
- ☆64Updated last year
- An open-source Kazakh Emotional Text-to-Speech Dataset☆27Updated 11 months ago
- g2p for english tts☆18Updated 2 years ago
- Bert-VITS2项目bug多且教程不友好。本proj尽可能修复了Bert-vits2项目的bug,并且可一键启动训练。仅需50条目标说话人语音,获得稳定、快速的TTS模型。☆49Updated 5 months ago
- G2pw's inference speed is accelerated by about 8-10 times. Change loop generated predictive data to only once and model loop prediction b…☆14Updated last year
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- Chinese polyphone disambiguation for Text-to-Speech application☆31Updated 8 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆51Updated last year
- CTC decoder with hotwords for ASR.☆16Updated last month
- Identify speakers with stable voice timbre.☆28Updated 8 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆86Updated last week
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 8 months ago
- MnTTS: An Open-Source Mongolian Text-to-Speech Synthesis Dataset and Accompanied Baseline. (Accepted by IALP'2022)☆18Updated 2 years ago
- Huawei Grad-TTS for Chinese☆46Updated last year
- cpp inference for EmotiVoice☆15Updated last year
- F5-TTS 推理加速,速度提升约4倍!☆48Updated last month
- E2E TTS using Conditional Flow Matching (Experimental*)☆69Updated last year
- noise reduction☆17Updated 8 months ago
- Glow-TTS with Stochastic Duration Predictor and Stochastic Pitch Predictor☆18Updated last year
- Training code for MaskGCT-T2S model.☆18Updated 2 months ago
- VoiceBank-2023 is the speech corpus specially designed for constructing personalized Mandarin text-to-speech (TTS) systems.☆39Updated last year
- Just another FastSpeech 2 but cleaner code :)☆26Updated 8 months ago
- DiTTo-TTS: Diffusion Transformers for Scalable Text-to-Speech without Domain-Specific Factors☆18Updated 3 weeks ago
- Implementation of DCComix TTS: An End-to-End Expressive TTS with Discrete Code Collaborated with Mixer☆75Updated last year