isaiahbjork/csm-voice-cloning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/isaiahbjork/csm-voice-cloning)

isaiahbjork / csm-voice-cloning

Sesame CSM 1B Voice Cloning

☆339

Alternatives and similar repositories for csm-voice-cloning

Users that are interested in csm-voice-cloning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mahimairaja / awesome-csm-1b
View on GitHub
List of curated use cases built using Sesame's CSM 1B
☆74May 29, 2025Updated last year
phildougherty / sesame_csm_openai
View on GitHub
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆437Sep 26, 2025Updated 10 months ago
davidbrowne17 / csm-streaming
View on GitHub
Realtime demo, Streaming and Finetuning code for CSM
☆456Sep 17, 2025Updated 10 months ago
zenoran / sesameai-tts
View on GitHub
☆21Jul 23, 2025Updated last year
senstella / csm-mlx
View on GitHub
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆408Aug 15, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,699May 27, 2025Updated last year
jazir555 / SesameConverse
View on GitHub
Sesame Converse - Real Time Conversations - Powered by Gemma 3
☆64Mar 19, 2025Updated last year
knottwill / sesame-finetune
View on GitHub
Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune
☆113Sep 27, 2025Updated 10 months ago
ruapotato / csm-buddy
View on GitHub
Playing with CSM
☆22Mar 14, 2025Updated last year
nytopop / csm
View on GitHub
A Conversational Speech Generation Model
☆14Mar 16, 2025Updated last year
EndlessReform / csm_mlx
View on GitHub
☆21Apr 6, 2025Updated last year
Haadesx / realtime-voice-csm
View on GitHub
Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…
☆17Mar 18, 2025Updated last year
PasiKoodaa / dia
View on GitHub
A TTS model capable of generating ultra-realistic dialogue in one pass.
☆32May 1, 2025Updated last year
KartDriver / mira_converse
View on GitHub
☆83Feb 28, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
davidbrowne17 / Mimi-Voice
View on GitHub
Create Unmute voice embeddings
☆26Nov 15, 2025Updated 8 months ago
canopyai / Orpheus-TTS
View on GitHub
Towards Human-Sounding Speech
☆6,264Dec 5, 2025Updated 7 months ago
lucasavila00 / LmScript
View on GitHub
Controllable Language Model Interactions in TypeScript
☆10May 17, 2024Updated 2 years ago
thomasgauthier / csm-hf
View on GitHub
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
☆58May 17, 2025Updated last year
remichu-ai / pai
View on GitHub
Your personal and private AI
☆54Apr 3, 2025Updated last year
SebastianBodza / Orpheus_Distributed_FastAPI
View on GitHub
☆15Mar 30, 2026Updated 3 months ago
thxxx / harper
View on GitHub
☆19Nov 9, 2025Updated 8 months ago
remichu-ai / pai-agent
View on GitHub
The accompany backend for PAI app
☆12Mar 24, 2025Updated last year
edwko / OuteTTS
View on GitHub
Interface for OuteTTS models.
☆1,436Mar 23, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
tarun7r / Vocal-Agent
View on GitHub
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆138Sep 7, 2025Updated 10 months ago
isaiahbjork / orpheus-tts-local
View on GitHub
Run Orpheus 3B Locally With LM Studio
☆546Mar 20, 2025Updated last year
jwest33 / latent_control_adapters
View on GitHub
Multi-vector latent space steering adapter module for language models
☆20Nov 22, 2025Updated 8 months ago
andrewginns / CoreMLPlayer
View on GitHub
Try CoreML models on multiple images and videos easily and quickly
☆41Nov 22, 2025Updated 8 months ago
Saganaki22 / CSM-WebUI
View on GitHub
Win & Liunux Gradio WebUI for CSM-1B model by sesame
☆52Mar 17, 2025Updated last year
jaco-bro / diajax
View on GitHub
Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.
☆30May 7, 2025Updated last year
ExoFi-Labs / OllamaGTTS
View on GitHub
☆202Mar 31, 2025Updated last year
ReisCook / VoiceAssistant
View on GitHub
A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM
☆81May 19, 2025Updated last year
Nyarlth / higgs-audio_quantized
View on GitHub
Quantized text-audio foundation model from Boson AI
☆43Aug 13, 2025Updated 11 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
electroglyph / quant_clone
View on GitHub
Generate a llama-quantize command to copy the quantization parameters of any GGUF
☆35Apr 20, 2026Updated 3 months ago
Zyphra / Zonos
View on GitHub
Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…
☆7,233Mar 5, 2025Updated last year
PkmX / orpheus-chat-webui
View on GitHub
Orpheus Chat WebUI
☆76Mar 27, 2025Updated last year
nytopop / illu
View on GitHub
realtime conversational dynamics
☆19Mar 19, 2025Updated last year
Lex-au / Orpheus-FastAPI
View on GitHub
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆716Jul 5, 2025Updated last year
aymanelotfi / monika
View on GitHub
Monika is an AI assistant that combines speech-to-text, natural language processing, and text-to-speech capabilities for seamless interac…
☆27Mar 31, 2025Updated last year
Lex-au / Vocalis
View on GitHub
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆310Apr 14, 2025Updated last year