isaiahbjork / csm-voice-cloningView external linksLinks
Sesame CSM 1B Voice Cloning
☆331Mar 15, 2025Updated 11 months ago
Alternatives and similar repositories for csm-voice-cloning
Users that are interested in csm-voice-cloning are comparing it to the libraries listed below
Sorting:
- List of curated use cases built using Sesame's CSM 1B☆72May 29, 2025Updated 8 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆431Sep 26, 2025Updated 4 months ago
- Generate a llama-quantize command to copy the quantization parameters of any GGUF☆30Jan 23, 2026Updated 3 weeks ago
- Realtime demo, Streaming and Finetuning code for CSM☆443Sep 17, 2025Updated 5 months ago
- ☆21Jul 23, 2025Updated 6 months ago
- An implementation of the CSM(Conversation Speech Model) for Apple Silicon using MLX.☆395Aug 15, 2025Updated 6 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Mar 19, 2025Updated 10 months ago
- A Conversational Speech Generation Model☆14,491May 27, 2025Updated 8 months ago
- Quantized text-audio foundation model from Boson AI☆43Aug 13, 2025Updated 6 months ago
- Playing with CSM☆22Mar 14, 2025Updated 11 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated 9 months ago
- Controllable Language Model Interactions in TypeScript☆10May 17, 2024Updated last year
- A Conversational Speech Generation Model☆14Mar 16, 2025Updated 11 months ago
- ☆83Feb 28, 2025Updated 11 months ago
- Real-time voice conversation system with Sesame CSM, featuring web-based audio visualization and GPU acceleration. Educational implementa…☆18Mar 18, 2025Updated 10 months ago
- ☆19Jul 4, 2025Updated 7 months ago
- Towards Human-Sounding Speech☆5,944Dec 5, 2025Updated 2 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆57May 17, 2025Updated 9 months ago
- An fully autonomous agent that accesses the browser and performs tasks.☆17Apr 25, 2025Updated 9 months ago
- A local-first LLM development studio. Build, test, and customize inference workflows with your own models — no cloud, totally local.☆17May 21, 2025Updated 8 months ago
- Running Microsoft's BitNet inference framework via FastAPI, Uvicorn and Docker.☆36Jul 2, 2025Updated 7 months ago
- ☆13Apr 25, 2025Updated 9 months ago
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆78May 19, 2025Updated 8 months ago
- Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.☆129Sep 7, 2025Updated 5 months ago
- Your personal and private AI☆55Apr 3, 2025Updated 10 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆285Apr 14, 2025Updated 10 months ago
- Interface for OuteTTS models.☆1,424Jun 21, 2025Updated 7 months ago
- AI Search engine☆13Sep 24, 2025Updated 4 months ago
- Open source static analysis toolkit for LLM agent plans☆13Aug 9, 2025Updated 6 months ago
- Scripts and tools for optimizing quantizations in llama.cpp with GGUF imatrices.☆18Jan 10, 2025Updated last year
- Run Orpheus 3B Locally With LM Studio☆515Mar 20, 2025Updated 10 months ago
- ☆21Apr 6, 2025Updated 10 months ago
- ☆201Mar 31, 2025Updated 10 months ago
- ☆134Dec 11, 2025Updated 2 months ago
- Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expres…☆7,160Mar 5, 2025Updated 11 months ago
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆30May 7, 2025Updated 9 months ago
- An AI Vision Language Model System for extracting structured knowledge graph information(JSON) from images of process diagrams☆41Apr 5, 2025Updated 10 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆212May 9, 2025Updated 9 months ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆30Jun 9, 2025Updated 8 months ago