Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment
☆172Feb 9, 2025Updated last year
Alternatives and similar repositories for ChatTTSPlus
Users that are interested in ChatTTSPlus are comparing it to the libraries listed below
Sorting:
- ☆36Sep 6, 2025Updated 6 months ago
- Data manipulation and transformation for audio signal processing, powered by PyTorch☆10Sep 30, 2024Updated last year
- Streaming Text to Speech Web UI☆22May 6, 2024Updated last year
- faster inference☆28Jan 20, 2025Updated last year
- (WIP)long form speech generatoins☆31Apr 2, 2025Updated 11 months ago
- Text-To-Speech for NotebookLM☆39Jul 20, 2025Updated 7 months ago
- FREECODEC: A DISENTANGLED NEURAL SPEECH CODEC WITH FEWER TOKENS☆24Sep 9, 2024Updated last year
- semantic tokenizer for speech and music☆21Jul 6, 2025Updated 8 months ago
- Wenet speech to text for react native☆10Nov 1, 2022Updated 3 years ago
- ✨✨Freeze-Omni: A Smart and Low Latency Speech-to-speech Dialogue Model with Frozen LLM☆368May 27, 2025Updated 9 months ago
- Offline Speaker Diarization with SenseVoice by Sherpa ONNX.☆15Dec 23, 2024Updated last year
- ☆23Oct 30, 2024Updated last year
- ☆204Sep 24, 2024Updated last year
- Official Code for ParrotTTS☆58Oct 13, 2024Updated last year
- SenseVoice-python: A enterprise-grade open source multi-language asr system from funasr opensource with onnxruntime☆109Oct 6, 2025Updated 5 months ago
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆121Mar 27, 2025Updated 11 months ago
- [ACL 2025] OZSpeech: One-step Zero-shot Speech Synthesis with Learned-Prior-Conditioned Flow Matching☆45Feb 9, 2025Updated last year
- Forced alignment decoder for Whisper.☆14Mar 13, 2024Updated last year
- LibriSpeech-Long is a benchmark dataset for long-form speech generation and processing. Released as part of "Long-Form Speech Generation …☆92Dec 28, 2024Updated last year
- ☆40Jul 15, 2025Updated 7 months ago
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆62Nov 1, 2024Updated last year
- Official implementation of the paper titled "Age and Gender Recognition Using a Convolutional Neural Network with a Specially Designed Mu…☆27Mar 5, 2024Updated 2 years ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- Generative Expressive Conversational Speech Synthesis (Accepted by MM'2024)☆78Nov 1, 2024Updated last year
- Source code for the EMNLP 2025 paper “DM-Codec: Distilling Multimodal Representations for Speech Tokenization”☆56Jun 1, 2025Updated 9 months ago
- The official repository of SpeechCraft dataset, a large-scale expressive bilingual speech dataset with natural language descriptions.☆183Updated this week
- Fast and High-Quality Zero-Shot Text-to-Speech with Flow Matching☆859Dec 2, 2025Updated 3 months ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆111Dec 20, 2024Updated last year
- A 6-million Audio-Caption Paired Dataset Built with a LLMs and ALMs-based Automatic Pipeline☆196Dec 13, 2024Updated last year
- ☆128Updated this week
- A large-scale speech corpus introduced in Spark-TTS, built from diverse open-source datasets for training text-to-speech (TTS) systems.☆105May 5, 2025Updated 10 months ago
- Towards Comprehensive Evaluation for End-to-End Spoken Dialogue Models☆50Sep 2, 2025Updated 6 months ago
- A enterprise-grade Voice Activity Detector from modelscope and funasr.☆129Apr 26, 2023Updated 2 years ago
- Grapheme-to-Phoneme for Mixed Chinese (Mandarin or Cantonese) and English.☆114Dec 2, 2025Updated 3 months ago
- F5-TTS 推理加速,速度提升约4倍!☆123Jan 6, 2025Updated last year
- Awesome Neural Codec Models, Text-to-Speech Synthesizers & Speech Language Models☆241Dec 18, 2025Updated 2 months ago
- Chinese and English Bilinguish G2P☆22Jul 16, 2023Updated 2 years ago
- Silero VAD(ncnn): pre-trained enterprise-grade Voice Activity Detector.☆24Aug 21, 2024Updated last year
- [ACMMM'2024] Generative Expressive Conversational Speech Synthesis☆44Oct 28, 2024Updated last year