ReisCook/VoiceAssistant

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/ReisCook/VoiceAssistant)

ReisCook / VoiceAssistant

A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM

☆81

Alternatives and similar repositories for VoiceAssistant

Users that are interested in VoiceAssistant are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

davidbrowne17 / Mimi-Voice
View on GitHub
Create Unmute voice embeddings
☆26Nov 15, 2025Updated 8 months ago
davidbrowne17 / csm-streaming
View on GitHub
Realtime demo, Streaming and Finetuning code for CSM
☆455Sep 17, 2025Updated 10 months ago
zenforic / csm-multi
View on GitHub
Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…
☆26Mar 28, 2025Updated last year
davidbrowne17 / csm-streaming-tf
View on GitHub
A transformers implementation of csm-streaming
☆30May 16, 2025Updated last year
knottwill / sesame-finetune
View on GitHub
Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune
☆113Sep 27, 2025Updated 9 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
phildougherty / dia_openai
View on GitHub
OpenAI compatible API for Dia-1.6B
☆35Apr 27, 2025Updated last year
mahimairaja / awesome-csm-1b
View on GitHub
List of curated use cases built using Sesame's CSM 1B
☆74May 29, 2025Updated last year
callbacked / os1
View on GitHub
A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser
☆138Jul 1, 2025Updated last year
Cross-Product-Labs / csm_finetune
View on GitHub
Finetune Sesame's CSM 1B model, for fun and profit
☆17Mar 24, 2025Updated last year
prakharsr / Orpheus-TTS-FastAPI
View on GitHub
A high-performance FastAPI-based server that provides OpenAI-compatible Text-to-Speech (TTS) endpoints using the Orpheus TTS https://gith…
☆31Nov 15, 2025Updated 8 months ago
asiff00 / On-Device-Speech-to-Speech-Conversational-AI
View on GitHub
This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…
☆255Nov 24, 2025Updated 7 months ago
cartesia-one / csm.rs
View on GitHub
Blazing-fast rust implementation of Sesame's Conversational Speech Model (CSM)
☆85Mar 26, 2026Updated 3 months ago
hololeo / click-n-ollamarun
View on GitHub
Bookmarklet to pull and run hugging face GGUF models in Ollama
☆18Oct 17, 2024Updated last year
nytopop / csm
View on GitHub
A Conversational Speech Generation Model
☆14Mar 16, 2025Updated last year
End-to-end encrypted email - Proton Mail • Ad
Special offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
taresh18 / conversify
View on GitHub
🗣️ Real‑time, low‑latency voice, vision, and conversational‑memory AI assistant built on LiveKit and local LLMs
☆111Jun 25, 2025Updated last year
loserbcc / open-unified-tts
View on GitHub
OpenAI-compatible TTS API that unifies multiple backends with smart chunking for unlimited-length generation
☆50May 5, 2026Updated 2 months ago
phildougherty / sesame_csm_openai
View on GitHub
OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT
☆437Sep 26, 2025Updated 9 months ago
smartaces / dia_podcast_generator
View on GitHub
☆54May 28, 2025Updated last year
CommanderZed / Physiclaw
View on GitHub
Specialized AI agents for your bare metal. 100% On-Prem & Air-Gap ready.
☆24Feb 21, 2026Updated 5 months ago
thad0ctor / KrunchWrapper
View on GitHub
☆18Jul 1, 2025Updated last year
kyutai-labs / unmute
View on GitHub
Make text LLMs listen and speak
☆1,366Updated this week
thad0ctor / llama-server-launcher
View on GitHub
Llama Server Launcher (llama.cpp/ik_llama) GUI
☆123Updated this week
calmstate / VisualTagger
View on GitHub
Visual Tagger is a JavaScript tool that visually highlights HTML elements for AIs, aiding in identifying interactive components on web pa…
☆12Oct 28, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
ealkanat / comfyui-easy-padding
View on GitHub
☆19Dec 31, 2024Updated last year
akashjss / sesame-csm
View on GitHub
A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.
☆214May 9, 2025Updated last year
carmelosantana / alpaca-bot
View on GitHub
Chat with Gemma, Llama2, Mistral and more. Automate content creation with custom assistants and AI agents.
☆13Jan 30, 2025Updated last year
KevinAHM / echo-tts-api
View on GitHub
Echo-TTS OpenAI Compatible Speech Endpoint w/ Streaming
☆29Apr 5, 2026Updated 3 months ago
PioneerMNDR / MousyHub
View on GitHub
Web application for roleplaying with AI-powered characters
☆67Jul 8, 2025Updated last year
senstella / csm-mlx
View on GitHub
An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.
☆406Aug 15, 2025Updated 11 months ago
jazir555 / SesameConverse
View on GitHub
Sesame Converse - Real Time Conversations - Powered by Gemma 3
☆64Mar 19, 2025Updated last year
ReisCook / Voice_Extractor
View on GitHub
Automated speech dataset creator
☆224Jun 12, 2025Updated last year
ncoder-ai / VibeVoice-FastAPI
View on GitHub
FastAPI wrapper around original Vibevoice 1.5B and 7B models, with support for AWQ4 quant
☆33Jun 22, 2026Updated last month
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
LAION-AI / Vocalino-V0.1-Voice-Acting-Pipeline
View on GitHub
Open-weights voice acting pipeline combining zero-shot voice cloning with natural-language direction. Provide a reference voice (or gener…
☆16May 25, 2026Updated last month
uwzis / Wordpress-Chatbot-Openwebui
View on GitHub
Open Source Wordpress chat bot plugin for wordpress. (UI in developmennt, Help wanted).
☆18Feb 15, 2025Updated last year
Etherll / Timbre
View on GitHub
Extract a target speaker’s clean, non-overlapped speech from multi-speaker audio and export word-safe LJSpeech-style TTS datasets.
☆21Jun 14, 2026Updated last month
wchisasa / rabbit
View on GitHub
An fully autonomous agent that accesses the browser and performs tasks.
☆18Apr 25, 2025Updated last year
mikjee / warpdrv
View on GitHub
Local LLM Server Manager + LlaMA.cpp + Chat
☆17Updated this week
shubhdotai / offline-voice-ai
View on GitHub
FastAPI + MLX offline-first voice agent with <1s latency. Minimal UI
☆56Oct 21, 2025Updated 9 months ago
jeremieLouvaert / ComfyUI-Darkroom
View on GitHub
Professional color grading & film emulation suite for ComfyUI. 161 film stocks with real Capture One curve data. Physics-based H&D curves…
☆87Updated this week