ain-soph / ChatTTSLinks
ChatTTS is a generative speech model for daily dialogue.
☆22Updated 6 months ago
Alternatives and similar repositories for ChatTTS
Users that are interested in ChatTTS are comparing it to the libraries listed below
Sorting:
- Huawei Grad-TTS for Chinese☆50Updated last year
- ☆65Updated last year
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆148Updated last year
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆99Updated 6 months ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆166Updated last year
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆103Updated 3 months ago
- text to speech using autoregressive transformer and VITS☆243Updated last year
- Train the next generation of TTS systems.☆165Updated 9 months ago
- F5-TTS 推理加速,速度提升约4倍!☆100Updated 6 months ago
- Unoffical implementation of Megatts2☆286Updated last year
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆92Updated last year
- ☆21Updated 8 months ago
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆224Updated last week
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆143Updated last year
- All generative model in one for better TTS model☆71Updated 10 months ago
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆210Updated last year
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆35Updated 10 months ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆110Updated 5 months ago
- 基于 g2pW 提升 pypinyin 的准确性☆94Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆98Updated last year
- The reproduced code for Google's SoundStorm☆267Updated last year
- ☆71Updated last year
- Chinese and English Bilinguish G2P☆21Updated last year
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆102Updated 5 months ago
- [TAFFC 2025] The official implementation of EmoSphere++: Emotion-Controllable Zero-Shot Text-to-Speech via Emotion-Adaptive Spherical Vec…☆96Updated 2 months ago
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆138Updated 8 months ago
- ☆85Updated last month
- ☆108Updated 7 months ago
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆68Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 10 months ago