ain-soph / ChatTTSLinks
ChatTTS is a generative speech model for daily dialogue.
☆23Updated 10 months ago
Alternatives and similar repositories for ChatTTS
Users that are interested in ChatTTS are comparing it to the libraries listed below
Sorting:
- F5-TTS 推理加速,速度提升约4倍!☆114Updated 10 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆107Updated 11 months ago
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆160Updated 2 years ago
- Huawei Grad-TTS for Chinese☆49Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆110Updated 8 months ago
- IndexTTS Fine-tuning notebooks☆116Updated 5 months ago
- text to speech using autoregressive transformer and VITS☆247Updated last year
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆210Updated last week
- The reproduced code for Google's SoundStorm☆269Updated 2 years ago
- ☆68Updated 2 years ago
- TTS FrontEnd DataSet: Polyphone / Prosody / TextNormalization☆101Updated last year
- Unoffical implementation of Megatts2☆287Updated last year
- ☆139Updated 2 months ago
- 基于语言学本体构建,全面覆盖汉语多音字、音变等现象的高效中文TTS数据集。A linguistically grounded and comprehensive Chinese TTS dataset, efficiently covering Chinese polyph…☆45Updated last year
- All generative model in one for better TTS model☆74Updated last year
- Train the next generation of TTS systems.☆169Updated last year
- Text-audio foundation model from Boson AI☆110Updated 2 months ago
- ☆71Updated 2 years ago
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆210Updated last year
- ☆23Updated last year
- ☆33Updated 2 years ago
- Di♪♪Rhythm 2: Efficient And High Fidelity Song Generation Via Block Flow Matching☆105Updated last week
- CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!☆98Updated 3 months ago
- Inference code for Audiodec-Valle-Wenetspeech4TTS☆50Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last year
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆80Updated last year
- ☆124Updated 2 weeks ago
- 基于 g2pW 提升 pypinyin 的准确性☆101Updated 2 years ago
- Chinese and English Bilinguish G2P☆21Updated 2 years ago
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆178Updated last year