kyutai-labs / moshivis
Kyutai with an "eye"
☆188Updated 3 weeks ago
Alternatives and similar repositories for moshivis:
Users that are interested in moshivis are comparing it to the libraries listed below
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆104Updated 2 weeks ago
- Code release for "LLMs can see and hear without any training"☆239Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆240Updated last month
- Googles NotebookLM but local☆202Updated this week
- ☆203Updated 3 weeks ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆516Updated 2 weeks ago
- ☆100Updated 7 months ago
- ☆219Updated last month
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆265Updated last week
- A pipeline parallel training script for LLMs.☆137Updated 3 weeks ago
- G2P☆218Updated last week
- ☆150Updated 2 months ago
- SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…☆204Updated 3 weeks ago
- ☆93Updated 4 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆160Updated this week
- ☆121Updated last week
- Collection of Open Source Speech Data☆153Updated 5 months ago
- LlamaVoice is a llama-based large voice generation model, providing inference and training ability.☆232Updated 7 months ago
- ☆57Updated 2 months ago
- PyTorch implementation of Audio Flamingo 2: An Audio-Language Model with Long-Audio Understanding and Expert Reasoning Abilities.☆459Updated last week
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆109Updated 5 months ago
- GRadient-INformed MoE☆261Updated 7 months ago
- Inference code for the paper "Spirit-LM Interleaved Spoken and Written Language Model".☆902Updated 5 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆710Updated last month
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆253Updated last month
- Video+code lecture on building nanoGPT from scratch☆65Updated 10 months ago
- A lightweight end-to-end text-to-speech model☆112Updated 2 months ago
- Daily Research Bot helps you stay on top of new AI-related research and updates. Currently supports: `huggingface.co/papers` and `hype.re…☆46Updated 5 months ago
- CodeScientist: An automated scientific discovery system for code-based experiments☆221Updated 3 weeks ago
- Speech To Speech: an effort for an open-sourced and modular GPT4-o☆53Updated 6 months ago