phildougherty / sesame_csm_openaiLinks

OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT

☆381

Alternatives and similar repositories for sesame_csm_openai

Users that are interested in sesame_csm_openai are comparing it to the libraries listed below

Sorting:

davidbrowne17 / csm-streaming
Realtime demo, Streaming and Finetuning code for CSM
☆364Updated 2 months ago
Lex-au / Orpheus-FastAPI
High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.
☆490Updated last month
isaiahbjork / orpheus-tts-local
Run Orpheus 3B Locally With LM Studio
☆446Updated 4 months ago
Lex-au / Vocalis
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…
☆200Updated 3 months ago
isaiahbjork / csm-voice-cloning
Sesame CSM 1B Voice Cloning
☆319Updated 4 months ago
devnen / Dia-TTS-Server
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…
☆304Updated 2 months ago
akashjss / sesame-csm
A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.
☆194Updated 2 months ago
mahimairaja / awesome-csm-1b
List of curated use cases built using Sesame's CSM 1B
☆69Updated 2 months ago
tarun7r / Vocal-Agent
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
☆105Updated last week
jazir555 / SesameConverse
Sesame Converse - Real Time Conversations - Powered by Gemma 3
☆63Updated 4 months ago
ValyrianTech / OpenVoice_server
API server for Instant voice cloning by MyShell.
☆98Updated 10 months ago
amanvirparhar / weebo
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
☆233Updated 6 months ago
kyutai-labs / unmute
Make text LLMs listen and speak
☆739Updated this week
PkmX / orpheus-chat-webui
Orpheus Chat WebUI
☆71Updated 4 months ago
davidbrowne17 / chatterbox-streaming
Streaming and Fine-tuning for Chatterbox TTS
☆143Updated last month
nazdridoy / kokoro-tts
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…
☆610Updated last week
senstella / csm-mlx
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆369Updated 2 months ago
callbacked / os1
A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser
☆106Updated last month
ExoFi-Labs / OllamaGTTS
☆186Updated 4 months ago
PierrunoYT / Kokoro-TTS-Local
A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…
☆204Updated 2 weeks ago
edwko / OuteTTS
Interface for OuteTTS models.
☆1,346Updated last month
Saganaki22 / OrpheusTTS-WebUI
Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]
☆101Updated 4 months ago
asiff00 / On-Device-Speech-to-Speech-Conversational-AI
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…
☆157Updated 3 months ago
KoljaB / LocalEmotionalAIVoiceChat
Simulates talk with an AI that can express emotions
☆77Updated last month
astramind-ai / Auralis
A Fast TTS Engine
☆526Updated 6 months ago
lucasnewman / f5-tts-mlx
Implementation of F5-TTS in MLX
☆567Updated 4 months ago
SingularityMan / vector_companion
A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…
☆227Updated last week
kaminoer / KokoDOS
Yet another self-hosted AI voice assistant. GlaDOS' blazing fast pipeline with Kokoro TTS voice and vision.
☆57Updated 6 months ago
freddyaboulton / orpheus-cpp
Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)
☆302Updated 3 months ago
matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆797Updated 6 months ago