nytopop / csmLinks
A Conversational Speech Generation Model
☆14Updated 5 months ago
Alternatives and similar repositories for csm
Users that are interested in csm are comparing it to the libraries listed below
Sorting:
- Streaming and Fine-tuning for Chatterbox TTS☆171Updated 2 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57Updated 3 months ago
- Provide Gradio custom components to make the diarization-based audio labeling process easier and faster.☆68Updated last month
- Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune☆70Updated 3 months ago
- ☆289Updated 2 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆382Updated 3 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆63Updated 5 months ago
- ☆21Updated 5 months ago
- Automatically cleaning, enhancing, segmenting, filtering, and formatting a dataset to fine tune or train a voice model.☆42Updated last month
- Run Orpheus 3B Locally With LM Studio☆459Updated 5 months ago
- ☆99Updated last year
- ☆276Updated last month
- ☆251Updated 2 months ago
- ☆210Updated last week
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆121Updated 3 weeks ago
- ☆222Updated 3 months ago
- Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS (E2 TTS) in MLX☆27Updated 10 months ago
- Efficient approach to speaker diarization using voice characteristics extraction☆99Updated 2 months ago
- realtime conversational dynamics☆19Updated 5 months ago
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- Real-time processing and delivery of sentences from a continuous stream of characters or text chunks.☆68Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆120Updated last month
- Finetune Sesame's CSM 1B model, for fun and profit☆17Updated 5 months ago
- Simulates talk with an AI that can express emotions☆78Updated 2 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆272Updated 3 months ago
- Open TTS models, built for streaming on the edge☆42Updated 5 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆186Updated 4 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆184Updated 4 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆397Updated last month
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆101Updated 5 months ago