List of curated use cases built using Sesame's CSM 1B
☆72May 29, 2025Updated 9 months ago
Alternatives and similar repositories for awesome-csm-1b
Users that are interested in awesome-csm-1b are comparing it to the libraries listed below
Sorting:
- Sesame CSM 1B Voice Cloning☆332Mar 15, 2025Updated 11 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57May 17, 2025Updated 9 months ago
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…☆18Mar 18, 2025Updated 11 months ago
- realtime conversational dynamics☆19Mar 19, 2025Updated 11 months ago
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆241Nov 24, 2025Updated 3 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆444Sep 17, 2025Updated 5 months ago
- Orpheus Chat WebUI☆75Mar 27, 2025Updated 11 months ago
- This is a side project where me and my friend try to generate synthetic data in bangla from deepseek-r1. So that can be used for model di…☆11Jun 28, 2025Updated 8 months ago
- Train and finutune text-to-speech models for Bengali and many other languages!☆18Apr 2, 2025Updated 11 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆397Aug 15, 2025Updated 6 months ago
- A transformers implementation of csm-streaming☆27May 16, 2025Updated 9 months ago
- FastRTC voice agent☆22Mar 18, 2025Updated 11 months ago
- ☆17May 9, 2024Updated last year
- Finetune Sesame AI's conversational speech model on new languages and voices. Blog post: https://blog.speechmatics.com/sesame-finetune☆102Sep 27, 2025Updated 5 months ago
- Run Orpheus 3B Locally With LM Studio☆519Mar 20, 2025Updated 11 months ago
- Real-time Speech-Text Foundation Model Toolkit (wip)☆254Mar 26, 2025Updated 11 months ago
- Generate audio datasets for training Text-To-Speech models, through smart audio splitting with silence detection, and transcription using…☆30May 27, 2023Updated 2 years ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆670Jul 5, 2025Updated 8 months ago
- Demo project for a horror game enemy that tries to find and catch the player using 3D navigation and a node-based finite state machine.☆22Dec 23, 2024Updated last year
- ☆21Apr 6, 2025Updated 11 months ago
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆350Apr 10, 2025Updated 10 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆26Mar 28, 2025Updated 11 months ago
- B-Llama3o a llama3 with Vision Audio and Audio understanding as well as text and Audio and Animation Data output.☆26Jun 3, 2024Updated last year
- Streaming Vocos☆30Jun 10, 2025Updated 8 months ago
- Interface for OuteTTS models.☆1,427Jun 21, 2025Updated 8 months ago
- A Conversational Speech Generation Model☆14,530May 27, 2025Updated 9 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3