List of curated use cases built using Sesame's CSM 1B
☆72May 29, 2025Updated 10 months ago
Alternatives and similar repositories for awesome-csm-1b
Users that are interested in awesome-csm-1b are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Finetune Sesame's CSM 1B model, for fun and profit☆17Mar 24, 2025Updated last year
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆434Sep 26, 2025Updated 6 months ago
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…☆18Mar 18, 2025Updated last year
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆244Nov 24, 2025Updated 4 months ago
- realtime conversational dynamics☆19Mar 19, 2025Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57May 17, 2025Updated 10 months ago
- Orpheus Chat WebUI☆75Mar 27, 2025Updated last year
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆77May 19, 2025Updated 10 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆448Sep 17, 2025Updated 6 months ago
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆52Mar 17, 2025Updated last year
- ☆21Apr 6, 2025Updated 11 months ago
- Telephony Server is a powerful bridge that connects telephony providers (Twilio, Vonage, Plivo, etc.) with real-time communication platfo…☆23Feb 2, 2025Updated last year
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆398Aug 15, 2025Updated 7 months ago
- FastRTC voice agent☆22Mar 18, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Arabic Grapheme-to-Phoneme (G2P) Conversion☆13Mar 15, 2025Updated last year
- This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model di…☆11Jun 28, 2025Updated 9 months ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated 11 months ago
- Run Orpheus 3B Locally With LM Studio☆527Mar 20, 2025Updated last year
- Playing with CSM☆22Mar 14, 2025Updated last year
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆350Apr 10, 2025Updated 11 months ago
- Kno2gether Agent PlayGround☆24Dec 16, 2024Updated last year
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆682Jul 5, 2025Updated 8 months ago
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Fine tuned llama 3 models for context based question answering in bengali language.☆18Oct 14, 2024Updated last year
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated last year
- Bitcoin utilities and protocol library for interacting with the network☆15Oct 27, 2025Updated 5 months ago
- Interface for OuteTTS models.☆1,431Mar 23, 2026Updated last week
- A framework making it effortless to convert any llm model into a reasoning agent like o1 or DeepSeek's r1☆24Oct 13, 2025Updated 5 months ago
- A Conversational Speech Generation Model☆14,559May 27, 2025Updated 10 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆294Apr 14, 2025Updated 11 months ago
- A Model (maybe an app) that translates the audio of a video from one language to another language, cloning the voice of original video wi…☆16May 19, 2025Updated 10 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- ☆453Nov 2, 2025Updated 4 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Mar 19, 2025Updated last year
- Script to generate an html reports of installed software, installed updates and installed components on a remote computer☆11Mar 13, 2025Updated last year
- SGLang is a fast serving framework for large language models and vision language models.☆21May 22, 2025Updated 10 months ago
- ☆54May 28, 2025Updated 10 months ago
- A car Heads Up Display built using a RGB LED strip and a Teensy microcontroller☆10Jul 5, 2017Updated 8 years ago
- Create cheatsheets out of videos☆17Aug 7, 2025Updated 7 months ago