kyutai-labs / moshivisLinks
Kyutai with an "eye"
☆217Updated 5 months ago
Alternatives and similar repositories for moshivis
Users that are interested in moshivis are comparing it to the libraries listed below
Sorting:
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆274Updated 2 months ago
- ☆514Updated last week
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆268Updated 3 months ago
- ☆102Updated 11 months ago
- Frontier Open-Source Text-to-Speech☆1,740Updated this week
- ☆155Updated 4 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆117Updated 2 weeks ago
- ☆282Updated last month
- The official GitHub Page for MiniMax☆50Updated last month
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆275Updated 2 weeks ago
- ☆215Updated 3 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆66Updated last week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆304Updated 4 months ago
- ☆57Updated 6 months ago
- Collection of Open Source Speech Data☆159Updated 9 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆102Updated 8 months ago
- ☆633Updated 3 weeks ago
- GRadient-INformed MoE☆265Updated 11 months ago