kyutai-labs / moshi-finetune
☆214Updated last month
Alternatives and similar repositories for moshi-finetune
Users that are interested in moshi-finetune are comparing it to the libraries listed below
Sorting:
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆246Updated last month
- Real-time Speech-Text Foundation Model Toolkit (wip)☆228Updated last month
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆207Updated this week
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆264Updated 2 months ago
- Collection of Open Source Speech Data☆157Updated 6 months ago
- ☆359Updated 8 months ago
- ☆156Updated last week
- VoiceBench: Benchmarking LLM-Based Voice Assistants☆196Updated last week
- A simple, hackable text-to-speech system in PyTorch and MLX☆159Updated 2 months ago
- An espeak-compatible, permissively-licensed IPA phonemizer (G2P) based on DeepPhonemizer. Usable as a drop-in replacement for espeak's GP…☆95Updated 7 months ago
- ☆126Updated last month
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆233Updated 8 months ago
- ☆256Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆148Updated 3 weeks ago
- ☆287Updated 11 months ago
- LiteASR: Efficient Automatic Speech Recognition with Low-Rank Approximation