davidbrowne17/csm-streaming

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/davidbrowne17/csm-streaming)

davidbrowne17 / csm-streaming

Realtime demo, Streaming and Finetuning code for CSM

☆456

Alternatives and similar repositories for csm-streaming

Users that are interested in csm-streaming are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

davidbrowne17 / csm-streaming-tf
View on GitHub
A transformers implementation of csm-streaming
☆30May 16, 2025Updated last year
thomasgauthier / csm-hf
View on GitHub
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
☆58May 17, 2025Updated last year
knottwill / sesame-finetune
View on GitHub
Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune
☆113Sep 27, 2025Updated 9 months ago
ReisCook / VoiceAssistant
View on GitHub
A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM
☆81May 19, 2025Updated last year
phildougherty / sesame_csm_openai
View on GitHub
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆437Sep 26, 2025Updated 10 months ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
davidbrowne17 / Mimi-Voice
View on GitHub
Create Unmute voice embeddings
☆26Nov 15, 2025Updated 8 months ago
davidbrowne17 / chatterbox-streaming
View on GitHub
Streaming and Fine-tuning for Chatterbox TTS
☆292Jun 15, 2025Updated last year
Cross-Product-Labs / csm_finetune
View on GitHub
Finetune Sesame's CSM 1B model, for fun and profit
☆17Mar 24, 2025Updated last year
isaiahbjork / csm-voice-cloning
View on GitHub
Sesame CSM 1B Voice Cloning
☆339Mar 15, 2025Updated last year
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,696May 27, 2025Updated last year
senstella / csm-mlx
View on GitHub
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆408Aug 15, 2025Updated 11 months ago
mahimairaja / awesome-csm-1b
View on GitHub
List of curated use cases built using Sesame's CSM 1B
☆74May 29, 2025Updated last year
ysharma3501 / FastMaya
View on GitHub
A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.
☆66Nov 17, 2025Updated 8 months ago
Saganaki22 / CSM-WebUI
View on GitHub
Win & Liunux Gradio WebUI for CSM-1B model by sesame
☆52Mar 17, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
akashjss / sesame-csm
View on GitHub
A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.
☆214May 9, 2025Updated last year
nytopop / illu
View on GitHub
realtime conversational dynamics
☆19Mar 19, 2025Updated last year
Lex-au / Orpheus-FastAPI
View on GitHub
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆717Jul 5, 2025Updated last year
nytopop / csm
View on GitHub
A Conversational Speech Generation Model
☆14Mar 16, 2025Updated last year
Lex-au / Vocalis
View on GitHub
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆309Apr 14, 2025Updated last year
taresh18 / TTSizer
View on GitHub
🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets
☆142Aug 10, 2025Updated 11 months ago
yangdongchao / RSTnet
View on GitHub
Real-time Speech-Text Foundation Model Toolkit (wip)
☆255Mar 26, 2025Updated last year
zenforic / csm-multi
View on GitHub
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆26Mar 28, 2025Updated last year
timonharz / Orpheus-FastAPI
View on GitHub
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆25May 16, 2025Updated last year
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,262Dec 5, 2025Updated 7 months ago
kyutai-labs / moshi-finetune
View on GitHub
☆474Oct 3, 2025Updated 9 months ago
randombk / chatterbox-vllm
View on GitHub
VLLM Port of the Chatterbox TTS model
☆379Oct 18, 2025Updated 9 months ago
ruapotato / csm-buddy
View on GitHub
Playing with CSM
☆22Mar 14, 2025Updated last year
asiff00 / On-Device-Speech-to-Speech-Conversational-AI
View on GitHub
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…
☆255Nov 24, 2025Updated 8 months ago
taresh18 / conversify
View on GitHub
🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs
☆111Jun 25, 2025Updated last year
mbzuai-oryx / LLMVoX
View on GitHub
LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM
☆308May 16, 2025Updated last year
jazir555 / SesameConverse
View on GitHub
Sesame Converse - Real Time Conversations - Powered by Gemma 3
☆64Mar 19, 2025Updated last year
voicepowered-ai / VibeVoice-finetuning
View on GitHub
Unofficial WIP LoRa Finetuning repository for VibeVoice
☆370Sep 24, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
Haadesx / realtime-voice-csm
View on GitHub
Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…
☆17Mar 18, 2025Updated last year
rsxdalv / chatterbox
View on GitHub
SoTA open-source TTS
☆165Dec 16, 2025Updated 7 months ago
yuriak / SpeechDialogueFactory
View on GitHub
☆40Apr 3, 2025Updated last year
TheAjaykrishnanR / TaraSharp
View on GitHub
Orpheus-TTS local speech synthesizer written entirely in C#
☆31Nov 25, 2025Updated 8 months ago
duerig / StyleTTS2
View on GitHub
StyleTTS 2 Optimized Training Fork
☆32Feb 2, 2025Updated last year
SebastianBodza / Orpheus_Distributed_FastAPI
View on GitHub
☆15Mar 30, 2026Updated 3 months ago
ysharma3501 / FastNeuTTS
View on GitHub
A highly optimized engine for neutts-air model to generate minutes of audio in seconds. Over 200x realtime on modern hardware!
☆118Nov 24, 2025Updated 8 months ago