kyutai-labs / moshivisLinks
Kyutai with an "eye"
☆230Updated 9 months ago
Alternatives and similar repositories for moshivis
Users that are interested in moshivis are comparing it to the libraries listed below
Sorting:
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆304Updated 2 months ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆306Updated 6 months ago
- ☆241Updated last week
- The official GitHub Page for MiniMax☆60Updated last month
- ☆532Updated 2 months ago
- ☆333Updated 4 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆291Updated 7 months ago
- Fast audio super resolution from 16khz to 48khz.☆92Updated this week
- ☆159Updated 8 months ago
- DACVAE☆177Updated last week
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆553Updated last month
- Collection of Open Source Speech Data☆163Updated 2 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆71Updated 4 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆346Updated 8 months ago
- ☆345Updated 2 months ago
- ☆101Updated last year
- ☆251Updated 7 months ago
- ☆57Updated 10 months ago
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆291Updated 3 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆225Updated 7 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆124Updated 4 months ago
- ☆482Updated 7 months ago
- ☆635Updated last month
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- GRadient-INformed MoE☆265Updated last year
- GLM-ASR-Nano: A robust, open-source speech recognition model with 1.5B parameters☆613Updated 2 weeks ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆334Updated 2 months ago
- A high quality and fast TTS repository☆358Updated last week
- List of curated use cases built using Sesame's CSM 1B☆73Updated 7 months ago
- Service for testing out the new Qwen2.5 omni model☆61Updated 7 months ago