yihuitang / StyleTTS_Mandarin
Implementation of StyleTTS for Mandarin
☆10Updated last year
Related projects: ⓘ
- ☆28Updated this week
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆43Updated 2 months ago
- Predict prosody labels for Chinese sentences.☆41Updated 2 years ago
- Chinese Prosodic Structure Prediction☆10Updated 5 years ago
- AutoPrep: An Automatic Preprocessing Framework for In-the-Wild Speech Data☆28Updated 8 months ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆79Updated 7 months ago
- ☆30Updated last year
- ☆74Updated 2 years ago
- Unofficial pytorch reproduction for the paper "Utilizing Neural Transducers for Two-Stage Text-to-Speech via Semantic Token Prediction" (…☆58Updated 5 months ago
- ☆62Updated 8 months ago
- ☆35Updated 7 months ago
- [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS☆60Updated 6 months ago
- Materials accompanying the paper "Phonological features for 0-shot multilingual speech synthesis"☆31Updated 4 years ago
- Objective metrics used in several text-to-speech (TTS) papers.☆46Updated 2 years ago
- The implementation of g2pL with a new open dataset.☆15Updated last year
- HiFi-GAN: Generative Adversarial Networks for Efficient and High Fidelity Speech Synthesis☆37Updated 3 years ago
- Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice☆58Updated last week
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago
- TTS-frontend with Bert and CRF/lstm (For Tacotron)☆48Updated 4 years ago
- PyTorch Implementation of Robust and fine-grained prosody control of end-to-end speech synthesis☆41Updated 2 years ago
- [InterSpeech 2020] "Improving the Speaker Identity of Non-Parallel Many-to-Many VoiceConversion with Adversarial Speaker Recognition" by …☆39Updated last year
- The open source code for SimpleSpeech series☆85Updated last month
- Chinese Text Normalization and Dataset☆78Updated 2 years ago
- a curated list of speech datasets (110+ datasets, 75+ easy to download)☆76Updated last year
- High fidelity, lightweight, end-to-end, streaming, convolution-based neural audio codec☆63Updated 3 months ago
- TTS Text Analyzer☆31Updated last year
- Evaluation Protocol for Large-Scale Zero-Shot TTS Literature☆44Updated last month
- ☆60Updated last year
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆32Updated 2 years ago
- ☆31Updated last year