kyutai-labs / moshivisLinks
Kyutai with an "eye"
☆200Updated 3 months ago
Alternatives and similar repositories for moshivis
Users that are interested in moshivis are comparing it to the libraries listed below
Sorting:
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆258Updated 3 weeks ago
- Delayed Streams Modeling (DSM) is a flexible formulation for streaming, multimodal sequence-to-sequence learning.☆211Updated this week
- ☆480Updated last week
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆259Updated last month
- ☆407Updated last month
- The official GitHub Page for MiniMax☆45Updated 3 weeks ago
- ☆149Updated 2 months ago
- Collection of Open Source Speech Data☆159Updated 7 months ago
- ☆432Updated last month
- ☆238Updated 2 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆64Updated last month
- ☆188Updated last month
- GRadient-INformed MoE☆263Updated 9 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆212Updated last month
- ☆57Updated 4 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆111Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆112Updated last month
- ☆163Updated 4 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆293Updated 2 weeks ago
- Lightweight toolkit package to train and fine-tune 1.58bit Language models☆80Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆577Updated 2 months ago
- List of curated use cases built using Sesame's CSM 1B☆66Updated 3 weeks ago
- ☆95Updated 6 months ago
- Easy to use, High Performant Knowledge Distillation for LLMs☆86Updated last month
- G2P☆262Updated last month
- ☆577Updated this week
- ☆101Updated 9 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆294Updated 2 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆174Updated 2 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆911Updated 7 months ago