kyutai-labs / moshivisLinks
Kyutai with an "eye"
☆234Updated 9 months ago
Alternatives and similar repositories for moshivis
Users that are interested in moshivis are comparing it to the libraries listed below
Sorting:
- Liquid Audio - Speech-to-Speech audio models by Liquid AI☆356Updated 2 weeks ago
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆305Updated 7 months ago
- ☆532Updated 3 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆293Updated 8 months ago
- ☆344Updated 4 months ago
- ☆245Updated 3 weeks ago
- ☆346Updated 3 months ago
- ☆158Updated 9 months ago
- Fast audio super resolution from 16khz to 48khz.☆177Updated 2 weeks ago
- The official GitHub Page for MiniMax☆60Updated 2 months ago
- ☆101Updated last year
- Collection of Open Source Speech Data☆164Updated 3 months ago
- A highly compressive and high-quality neural audio codec for speech models.☆209Updated 2 weeks ago
- Tiny Model, Big Logic: Diversity-Driven Optimization Elicits Large-Model Reasoning Ability in VibeThinker-1.5B☆560Updated 2 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆347Updated 9 months ago
- Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya☆125Updated 5 months ago
- ☆254Updated 8 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆228Updated 8 months ago
- AnyModal is a Flexible Multimodal Language Model Framework for PyTorch☆103Updated last year
- DACVAE☆187Updated 3 weeks ago
- AudioStory: Generating Long-Form Narrative Audio with Large Language Models☆295Updated 3 months ago
- An open source chat bot architecture for voice/vision (and multimodal) assistants, local(CPU/GPU bound) and remote(I/O bound) to run.☆86Updated 3 weeks ago
- GRadient-INformed MoE☆264Updated last year
- Video+code lecture on building nanoGPT from scratch☆68Updated last year
- ☆483Updated 8 months ago
- Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)☆344Updated last week
- ☆57Updated 11 months ago
- A pipeline parallel training script for LLMs.☆165Updated 8 months ago
- List of curated use cases built using Sesame's CSM 1B☆73Updated 7 months ago
- ☆473Updated 8 months ago