mahimairaja / awesome-csm-1b
List of curated use cases built using Sesame's CSM 1B
☆62Updated last month
Alternatives and similar repositories for awesome-csm-1b:
Users that are interested in awesome-csm-1b are comparing it to the libraries listed below
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆61Updated last month
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆100Updated last week
- A lightweight recreation of OS1/Samantha from the movie Her, running locally in the browser☆70Updated this week
- Faster Whisper with additional features☆43Updated last month
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆277Updated last week
- A cutting-edge Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆58Updated 2 weeks ago
- ☆69Updated last month
- Orpheus Chat WebUI☆52Updated 3 weeks ago
- A Model Context Protocol (MCP) server for ATLAS, a Neo4j-powered task management system for LLM Agents - implementing a three-tier archit…☆128Updated 2 weeks ago
- PocketFlow's node-based workflow structure, with Manus' agents and tools!☆192Updated this week
- ☆130Updated last week
- A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.☆130Updated 10 months ago
- A comprehensive platform for managing, testing, and leveraging Ollama AI models with advanced features for customization, workflow automa…☆47Updated last month
- Autonomous debugging agent MCP server☆126Updated this week
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆43Updated 3 weeks ago
- Model Context Protocol server for Replicate's API☆52Updated last month
- API server for Instant voice cloning by MyShell.☆89Updated 7 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆294Updated this week
- Turn text from websites into spoken audio with edge-tts, F5, etc. and save as mp3 files☆46Updated last month
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 2 months ago
- Excalidraw meets ComfyUI for LLMs☆250Updated 2 months ago
- ☆91Updated 3 months ago
- A Multi-modal MCP client for voice powered agentic workflows☆167Updated 2 months ago
- Chat Application Starter Kit — Gemini Multimodal Live API + Pipecat☆188Updated last month
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 6 months ago
- The agentic video editing framework☆116Updated 2 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 3 weeks ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆40Updated last month
- Production-ready FastAPI wrapper for Zonos TTS models with GPU acceleration, voice cloning, and emotion control. Supports both Transforme…☆35Updated 2 months ago
- Voice AI agent starter kit with Groq, Llama 4, and (optionally) Twilio☆67Updated last week