mahimairaja/awesome-csm-1b

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mahimairaja/awesome-csm-1b)

mahimairaja / awesome-csm-1b

List of curated use cases built using Sesame's CSM 1B

☆74

Alternatives and similar repositories for awesome-csm-1b

Users that are interested in awesome-csm-1b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

isaiahbjork / csm-voice-cloning
View on GitHub
Sesame CSM 1B Voice Cloning
☆339Mar 15, 2025Updated last year
Cross-Product-Labs / csm_finetune
View on GitHub
Finetune Sesame's CSM 1B model, for fun and profit
☆17Mar 24, 2025Updated last year
phildougherty / sesame_csm_openai
View on GitHub
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆437Sep 26, 2025Updated 10 months ago
Haadesx / realtime-voice-csm
View on GitHub
Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…
☆17Mar 18, 2025Updated last year
nytopop / illu
View on GitHub
realtime conversational dynamics
☆19Mar 19, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
thomasgauthier / csm-hf
View on GitHub
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
☆58May 17, 2025Updated last year
asiff00 / On-Device-Speech-to-Speech-Conversational-AI
View on GitHub
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…
☆255Nov 24, 2025Updated 8 months ago
davidbrowne17 / csm-streaming
View on GitHub
Realtime demo, Streaming and Finetuning code for CSM
☆456Sep 17, 2025Updated 10 months ago
PkmX / orpheus-chat-webui
View on GitHub
Orpheus Chat WebUI
☆76Mar 27, 2025Updated last year
ReisCook / VoiceAssistant
View on GitHub
A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM
☆81May 19, 2025Updated last year
aymanelotfi / monika
View on GitHub
Monika is an AI assistant that combines speech-to-text, natural language processing, and text-to-speech capabilities for seamless interac…
☆27Mar 31, 2025Updated last year
smartaces / dia_podcast_generator
View on GitHub
☆54May 28, 2025Updated last year
EndlessReform / csm_mlx
View on GitHub
☆21Apr 6, 2025Updated last year
steinathan / telephony-server
View on GitHub
Telephony Server is a powerful bridge that connects telephony providers (Twilio, Vonage, Plivo, etc.) with real-time communication platfo…
☆23Feb 2, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
Saganaki22 / CSM-WebUI
View on GitHub
Win & Liunux Gradio WebUI for CSM-1B model by sesame
☆52Mar 17, 2025Updated last year
senstella / csm-mlx
View on GitHub
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆408Aug 15, 2025Updated 11 months ago
KillerShoaib / DeepSeek-r1-Bangla-Reasoning-Data
View on GitHub
This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model di…
☆11Jun 28, 2025Updated last year
asiff00 / Training-TTS
View on GitHub
Train and finutune text-to-speech models for Bengali and many other languages!
☆18Apr 2, 2025Updated last year
knottwill / sesame-finetune
View on GitHub
Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune
☆113Sep 27, 2025Updated 10 months ago
isaiahbjork / orpheus-tts-local
View on GitHub
Run Orpheus 3B Locally With LM Studio
☆546Mar 20, 2025Updated last year
taylorchu / 2cent-tts
View on GitHub
☆58Feb 8, 2026Updated 5 months ago
ruapotato / csm-buddy
View on GitHub
Playing with CSM
☆22Mar 14, 2025Updated last year
Unmortan-Ellary / Vascura-FRONT
View on GitHub
Bloat Free, Portable and Lightweight LLM Frontend (Single HTML file). With Lorebook, Web Search, Macro Engine etc.
☆22Jul 18, 2026Updated last week
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dant2021 / a-research
View on GitHub
I publish my weekly research here
☆20Jun 26, 2025Updated last year
avijeett007 / kno2gether-livekit-playground
View on GitHub
Kno2gether Agent PlayGround
☆24Dec 16, 2024Updated last year
daily-co / pipecat-cloud-images
View on GitHub
Pipecat Cloud agent docker images and examples
☆17Updated this week
jeinselen / Blender-ProductionKit
View on GitHub
Production shortcuts and toolsets for Blender 4.2+
☆13Jul 10, 2026Updated 2 weeks ago
freddyaboulton / orpheus-cpp
View on GitHub
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆353Apr 10, 2025Updated last year
Lex-au / Orpheus-FastAPI
View on GitHub
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆716Jul 5, 2025Updated last year
asiff00 / Bangla-Llama
View on GitHub
Fine tuned llama 3 models for context based question answering in bengali language.
☆17Oct 14, 2024Updated last year
jwest33 / latent_control_adapters
View on GitHub
Multi-vector latent space steering adapter module for language models
☆20Nov 22, 2025Updated 8 months ago
zenforic / csm-multi
View on GitHub
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆26Mar 28, 2025Updated last year
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
amrrs / scrapegraph-code
View on GitHub
☆17May 9, 2024Updated 2 years ago
edwko / OuteTTS
View on GitHub
Interface for OuteTTS models.
☆1,436Mar 23, 2026Updated 4 months ago
mush42 / mantoq
View on GitHub
Arabic Grapheme-to-Phoneme (G2P) Conversion
☆16Mar 15, 2025Updated last year
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,699May 27, 2025Updated last year
The-Swarm-Corporation / AgentGym
View on GitHub
A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1
☆24Oct 13, 2025Updated 9 months ago
jazir555 / SesameConverse
View on GitHub
Sesame Converse - Real Time Conversations - Powered by Gemma 3
☆64Mar 19, 2025Updated last year
Lex-au / Vocalis
View on GitHub
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆310Apr 14, 2025Updated last year