OpenT2S / LlamaVoice
LlamaVoice is a llama-based large voice generation model, providing inference and training ability.
☆231Updated 7 months ago
Alternatives and similar repositories for LlamaVoice:
Users that are interested in LlamaVoice are comparing it to the libraries listed below
- Real-time Speech-Text Foundation Model Toolkit (wip)☆224Updated 2 weeks ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆103Updated this week
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆247Updated last month
- We Speech Transcript based on LLM, in 300 lines of code.☆157Updated last month
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆261Updated last month
- Collection of Open Source Speech Data☆153Updated 5 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆400Updated 7 months ago
- ☆354Updated 7 months ago
- ☆189Updated last week
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆171Updated 2 weeks ago
- An unofficial PyTorch implementation of the StreamVC(Real-Time Low-Latency Voice Conversion)☆120Updated 8 months ago
- Baichuan-Audio: A Unified Framework for End-to-End Speech Interaction☆176Updated last month
- Official implementation of the TTS model Lina-Speech