Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech Generation Model
☆26Mar 28, 2025Updated last year
Alternatives and similar repositories for csm-multi
Users that are interested in csm-multi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Mar 19, 2025Updated last year
- ☆15Mar 18, 2026Updated 3 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 9 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆59Dec 1, 2024Updated last year
- ☆16Feb 1, 2025Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- ☆12Mar 22, 2024Updated 2 years ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆214May 9, 2025Updated last year
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- Service for testing out the new Qwen2.5 omni model☆62Apr 30, 2025Updated last year
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆58May 17, 2025Updated last year
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆24Aug 5, 2025Updated 10 months ago
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆80May 19, 2025Updated last year
- Your personal ArXiv Feed☆23Dec 18, 2024Updated last year
- Diffusion Pipe for Windows For ComfyUI☆28Jan 20, 2026Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Get Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- The purpose of this repository is to discuss on Audio transformers☆14Apr 16, 2026Updated 2 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆437Sep 26, 2025Updated 9 months ago
- VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate☆18May 16, 2025Updated last year
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 5 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆454Sep 17, 2025Updated 9 months ago
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated 2 years ago
- An example FastAPI server that streams messages from Autogen using OpenAI API format☆15Jul 3, 2024Updated 2 years ago
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆46Dec 22, 2025Updated 6 months ago
- ☆65Jun 24, 2025Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Open-source, modular cloud automation and billing system.☆17Jun 4, 2026Updated 3 weeks ago
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆66Nov 17, 2025Updated 7 months ago
- LLMProxy is an intelligent large language model backend routing proxy service.☆25Dec 6, 2025Updated 6 months ago
- ☆19Jun 17, 2025Updated last year
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51May 19, 2025Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆32May 1, 2025Updated last year
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆17Jul 11, 2023Updated 2 years ago
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆38Aug 3, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- ShellSpeak translates natural language to shell commands, simplifying system interactions for non-tech-savvy users. With color-coded UI, …☆12Nov 26, 2023Updated 2 years ago
- Personal voice assistant, with voice interruption and Twilio support☆18Feb 24, 2025Updated last year
- ChatGPT-rs is a lightweight ChatGPT client with a graphical user interface, written in Rust. It allows you to chat with OpenAI's GPT mode…☆13Apr 5, 2023Updated 3 years ago
- UpToDateAI, an open source tool to help you help AI assist you with coding and debugging in lesser-known or newly released programming fr…☆12Sep 10, 2024Updated last year
- Portrait Tools: Facial detection cropping, alignment, ID photo, etc☆21Jun 15, 2025Updated last year
- klmbr - a prompt pre-processing technique to break through the barrier of entropy while generating text with LLMs☆89Sep 22, 2024Updated last year
- Demo repository for creating a custom chatbot powered by LLMs for Telegram and Whatsapp.☆15Jan 18, 2024Updated 2 years ago