akashjss/sesame-csm

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/akashjss/sesame-csm)

akashjss / sesame-csm

A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.

☆214

Alternatives and similar repositories for sesame-csm

Users that are interested in sesame-csm are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

zenforic / csm-multi
View on GitHub
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆26Mar 28, 2025Updated last year
davidbrowne17 / csm-streaming
View on GitHub
Realtime demo, Streaming and Finetuning code for CSM
☆456Sep 17, 2025Updated 10 months ago
PkmX / orpheus-chat-webui
View on GitHub
Orpheus Chat WebUI
☆76Mar 27, 2025Updated last year
fidecastro / llama-cpp-connector
View on GitHub
Super simple python connectors for llama.cpp, including vision models (Gemma 3, Qwen2-VL). Compile llama.cpp and run!
☆31Dec 11, 2025Updated 7 months ago
phildougherty / sesame_csm_openai
View on GitHub
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆437Sep 26, 2025Updated 10 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
jazir555 / SesameConverse
View on GitHub
Sesame Converse - Real Time Conversations - Powered by Gemma 3
☆64Mar 19, 2025Updated last year
akashjss / orpheus-tts-local-webui
View on GitHub
Run Orpheus 3B Locally with Gradio UI, Standalone App
☆25Apr 1, 2025Updated last year
dynamiccreator / voice-text-reader
View on GitHub
Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)
☆51Oct 18, 2024Updated last year
phildougherty / qwen2.5_omni_chat
View on GitHub
Service for testing out the new Qwen2.5 omni model
☆62Apr 30, 2025Updated last year
fishiatee / Tumera
View on GitHub
Yet another frontend for LLM, written using .NET and WinUI 3
☆11Sep 14, 2025Updated 10 months ago
SesameAILabs / csm
View on GitHub
A Conversational Speech Generation Model
☆14,696May 27, 2025Updated last year
Lex-au / Vocalis
View on GitHub
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆310Apr 14, 2025Updated last year
matteoserva / GraphLLM
View on GitHub
☆212Jan 5, 2026Updated 6 months ago
masterFoad / NanoSage
View on GitHub
Local LLM Powered Recursive Search & Smart Knowledge Explorer
☆267May 13, 2026Updated 2 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
avarayr / suaveui
View on GitHub
Open source LLM UI, compatible with all local LLM providers.
☆177Sep 20, 2024Updated last year
ReisCook / VoiceAssistant
View on GitHub
A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM
☆81May 19, 2025Updated last year
isaiahbjork / orpheus-tts-local
View on GitHub
Run Orpheus 3B Locally With LM Studio
☆546Mar 20, 2025Updated last year
hasaranga / NativeChat
View on GitHub
win32 native frontend for llama-cli
☆14Nov 2, 2024Updated last year
smartaces / dia_podcast_generator
View on GitHub
☆54May 28, 2025Updated last year
thomasgauthier / csm-hf
View on GitHub
Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers
☆58May 17, 2025Updated last year
Toy-97 / Chat-WebUI
View on GitHub
Chat WebUI is an easy-to-use user interface for interacting with AI, and it comes with multiple useful built-in tools such as web search …
☆52Feb 10, 2026Updated 5 months ago
duynt575 / kokoro-voice-composer-backup
View on GitHub
Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.
☆55Mar 11, 2025Updated last year
remichu-ai / pai
View on GitHub
Your personal and private AI
☆54Apr 3, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
prateekvellala / retrieval-experiments
View on GitHub
Exploring retrieval systems for language models
☆14Apr 12, 2025Updated last year
Azzedde / aiva_mock_interviews
View on GitHub
AIVA (AI Virtual Assistant) Mock Interviews is an interactive platform that simulates real interview scenarios using AI-generated questio…
☆72Oct 4, 2025Updated 9 months ago
NebuLlamaUI / NebuLlamaUI
View on GitHub
An interface that features barely zero external dependencies beyond the Ollama API itself, making it lightweight and portable to easily i…
☆12Mar 25, 2025Updated last year
Mahrkeenerh / lfind
View on GitHub
A natural language file search tool that uses LLMs to help you find files by describing what you're looking for.
☆28Mar 8, 2025Updated last year
syv-ai / PybberLink
View on GitHub
☆13Mar 10, 2025Updated last year
loserbcc / open-unified-tts
View on GitHub
OpenAI-compatible TTS API that unifies multiple backends with smart chunking for unlimited-length generation
☆50May 5, 2026Updated 2 months ago
autollama / autollama
View on GitHub
Anthropic's Contextual Retrieval implementation with visual chunk comparison. Preview context enrichment before/after embedding.
☆30Sep 25, 2025Updated 10 months ago
YofarDev / yofardev_ai
View on GitHub
☆47Apr 29, 2026Updated 3 months ago
mehtabmahir / easy-whisper-ui
View on GitHub
Easy to use interface for the Whisper model optimized for all GPUs!
☆556Feb 15, 2026Updated 5 months ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
charmandercha / ArchiDoc
View on GitHub
☆16Dec 16, 2024Updated last year
ILikeAI / AlwaysReddy
View on GitHub
AlwaysReddy is a LLM voice assistant that is always just a hotkey away.
☆757Mar 4, 2025Updated last year
edwko / OuteTTS
View on GitHub
Interface for OuteTTS models.
☆1,436Mar 23, 2026Updated 4 months ago
cocktailpeanutlabs / storydiffusion-comics
View on GitHub
☆16Apr 3, 2025Updated last year
calmstate / VisualTagger
View on GitHub
Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…
☆12Oct 28, 2024Updated last year
ExoFi-Labs / OllamaGTTS
View on GitHub
☆202Mar 31, 2025Updated last year
astramind-ai / Pulsar
View on GitHub
The hearth of The Pulsar App, fast, secure and shared inference with modern UI
☆58Dec 1, 2024Updated last year