GiantAILab / DiaMoE-TTSLinks
Official code for"DiaMoE-TTS: A Unified IPA-based Dialect TTS Framework with Mixture-of-Experts and Parameter-Efficient Zero-Shot Adaptation"
☆210Updated last month
Alternatives and similar repositories for DiaMoE-TTS
Users that are interested in DiaMoE-TTS are comparing it to the libraries listed below
Sorting:
- FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.☆235Updated 2 months ago
- ☆23Updated last year
- We Speech Toolkit, LLM based Speech Toolkit for Speech Understanding, Generation, and Interaction☆172Updated this week
- An Open-Source Project to Unify Audio Processing and Generation☆159Updated 2 weeks ago
- This project is to train an RWKV LLM for TTS generation which compatible to other TTS engine(like fish/cosy/chattts).☆92Updated 3 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆110Updated last year
- ☆41Updated 11 months ago
- ☆105Updated 3 months ago
- ☆112Updated 2 months ago
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.☆101Updated 8 months ago
- OpenS2S : Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model☆101Updated 5 months ago
- CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!☆110Updated 5 months ago
- Streamable Text-to-Speech model using a language modeling approach, without vector quantization☆106Updated 7 months ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆111Updated last year
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆216Updated 10 months ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆113Updated last month
- ☆94Updated 2 months ago
- Official code for "F5R-TTS: Improving Flow-Matching based Text-to-Speech with Group Relative Policy Optimization"☆141Updated 7 months ago
- Curated list for papers, codes and resources related to Text-to-Audio (TTA) Generation☆68Updated this week
- Official Code for ParrotTTS☆58Updated last year
- Official code for "EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting"☆103Updated 2 months ago
- An instruct text-to-speech solution based on LLaSA and CosyVoice2 developed by the ASLP lab and collaborators.☆24Updated this week
- The official repository for the paper “NonVerbalSpeech-38K: A Scalable Pipeline for Enabling Non-Verbal Speech Generation and Understandi…☆62Updated 2 weeks ago
- ☆166Updated 4 months ago
- Text-audio foundation model from Boson AI☆116Updated 4 months ago
- MOSS-Speech is a true speech-to-speech large language model without text guidance.☆115Updated last month
- [NeurIPS' 25] Benchmark for evaluating TTS models on complex prosodic, expressiveness, and linguistic challenges.☆182Updated last month
- All generative model in one for better TTS model☆74Updated last year
- Official implementation of paper: Shallow Flow Matching for Coarse-to-Fine Text-to-Speech Synthesis☆48Updated 3 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆77Updated last year