MYZY-AI / Muyan-TTSLinks
☆400Updated 2 weeks ago
Alternatives and similar repositories for Muyan-TTS
Users that are interested in Muyan-TTS are comparing it to the libraries listed below
Sorting:
- ☆377Updated 3 weeks ago
- Added vLLM support to IndexTTS for faster inference.☆171Updated last week
- GPT-4o-level, real-time spoken dialogue system.☆327Updated 4 months ago
- Dolphin is a multilingual, multitask ASR model jointly trained by DataoceanAI and Tsinghua University.☆499Updated 2 weeks ago
- OSUM: Open Speech Understanding Model, open-sourced by ASLP@NPU.☆369Updated this week
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆567Updated last month
- An Open-Sourced LLM-empowered Foundation TTS System☆715Updated last week
- InspireMusic: Music, Song, Audio Generation.☆1,107Updated last week
- ☆197Updated 8 months ago
- Open source inference code for Rev's model☆404Updated last month
- 使用vllm加速cosyvoice2的推理☆312Updated last month
- G2P☆248Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆162Updated last month
- A Fast TTS Engine☆502Updated 4 months ago
- Extension of ChatTTS, 3x Faster on Windows, Support Voice Cloning and Mobile Deployment☆158Updated 3 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆410Updated 8 months ago
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- A Low-Latency, Lightweight and High-Performance Streaming VAD☆394Updated last week
- F5-TTS 推理加速,速度提升约4倍!☆92Updated 4 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 9 months ago
- Kyutai with an "eye"☆197Updated 2 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆731Updated 2 months ago
- Interface for OuteTTS models.☆1,283Updated last week
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆303Updated last week
- Running the F5-TTS by ONNX Runtime☆155Updated 2 weeks ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆195Updated 3 months ago
- ☆160Updated 6 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆228Updated 2 months ago
- ☆359Updated 10 months ago
- [ICML 2025] SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation☆227Updated 2 months ago