kyutai-labs / moshivisLinks

Kyutai with an "eye"

☆212

Alternatives and similar repositories for moshivis

Users that are interested in moshivis are comparing it to the libraries listed below

Sorting:

playht / PlayDiffusion
☆510Updated last month
jasonppy / VoiceStar
VoiceStar: Robust, Duration-controllable TTS that can Extrapolate
☆269Updated 2 months ago
AlexBodner / How_Much_VRAM
☆102Updated 11 months ago
mbzuai-oryx / LLMVoX
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
☆268Updated 2 months ago
MiniMax-AI / MiniMax-AI.github.io
The official GitHub Page for MiniMax
☆49Updated last month
menloresearch / ReZero
☆155Updated 3 months ago
google-deepmind / videoprism
Official repository for "VideoPrism: A Foundational Visual Encoder for Video Understanding" (ICML 2024)
☆262Updated this week
freddyaboulton / orpheus-cpp
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆302Updated 3 months ago
fluxions-ai / vui
☆628Updated last week
kyutai-labs / moshi-finetune
☆269Updated 3 weeks ago
maitrix-org / Voila
☆429Updated 3 months ago
MYZY-AI / Muyan-TTS
☆447Updated 2 months ago
slp-rl / slamkit
SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on On…
☆215Updated 2 months ago
randombk / chatterbox-vllm
VLLM Port of the Chatterbox TTS model
☆156Updated this week
ritabratamaiti / AnyModal
AnyModal is a Flexible Multimodal Language Model Framework for PyTorch
☆101Updated 7 months ago
hlt-mt / mosel
Collection of Open Source Speech Data
☆159Updated 8 months ago
ictnlp / LLaMA-Omni2
☆207Updated 2 months ago
tdrussell / qlora-pipe
A pipeline parallel training script for LLMs.
☆153Updated 3 months ago
nahidalam / maya
Maya: An Instruction Finetuned Multilingual Multimodal Model using Aya
☆117Updated 2 weeks ago
huggingface / huggingface-gemma-recipes
Inference, Fine Tuning and many more recipes with Gemma family of models
☆262Updated 2 weeks ago
ArturTanona / grpo_unsloth_docker
☆57Updated 5 months ago
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆88Updated 3 months ago
chentuochao / Spatial-Speech-Translation
The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"
☆65Updated 2 months ago
Finity-Alpha / OpenVoiceChat
Have a natural voice conversation with an LLM
☆252Updated 7 months ago
microsoft / GRIN-MoE
GRadient-INformed MoE
☆264Updated 10 months ago
nivibilla / build-nanogpt
Video+code lecture on building nanoGPT from scratch
☆69Updated last year
AK391 / gemini-gradio
☆95Updated 7 months ago
modelscope / modelscope-studio
A third-party component library based on Gradio.
☆110Updated last week
MetaStone-AI / XBai-o4
☆207Updated this week
bklieger-groq / gradio-groq-basics
Building Blocks for Multi-Modal Gradio Powered by Groq Apps
☆112Updated 9 months ago