kyutai-labs / moshivis
Kyutai with an "eye"
☆160Updated last week
Alternatives and similar repositories for moshivis:
Users that are interested in moshivis are comparing it to the libraries listed below
- Googles NotebookLM but local☆167Updated last week
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆224Updated last week
- ☆210Updated 2 weeks ago
- G2P☆182Updated this week
- ☆142Updated last month
- Collection of Open Source Speech Data☆152Updated 4 months ago
- ☆74Updated 6 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆186Updated last week
- ☆99Updated 7 months ago
- ☆56Updated last month
- Free Search is a wrapper on top of publicly available SearXNG instances to give free internet access as a rest API.☆147Updated this week
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆232Updated 7 months ago
- Video+code lecture on building nanoGPT from scratch☆66Updated 9 months ago
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆108Updated 4 months ago
- Code release for "LLMs can see and hear without any training"☆231Updated last month
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆97Updated last week
- ☆92Updated 3 months ago
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆238Updated 3 weeks ago
- GPT-4o-level, real-time spoken dialogue system.☆302Updated 2 months ago
- A pipeline parallel training script for LLMs.☆136Updated this week
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆893Updated 5 months ago
- A lightweight end-to-end text-to-speech model☆111Updated last month
- ☆134Updated last month
- ☆171Updated 7 months ago
- II-Researcher: a new open-source framework designed to aid building search / research agents☆107Updated this week
- ☆56Updated 4 months ago
- DRT-o1: Optimized Deep Reasoning Translation via Long Chain-of-Thought☆211Updated 3 months ago
- RealSI: Open Benchmark for Simultaneous Interpretation in Real-world Scenarios☆53Updated 4 months ago
- Unsloth Fine-tuning Notebooks for Google Colab, Kaggle, Hugging Face and more.☆105Updated this week
- 💡 VideoMind: A Chain-of-LoRA Agent for Long Video Reasoning☆37Updated this week