Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech Generation Model
☆26Mar 28, 2025Updated last year
Alternatives and similar repositories for csm-multi
Users that are interested in csm-multi are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆64Mar 19, 2025Updated last year
- ☆15Mar 18, 2026Updated 2 months ago
- Yet another frontend for LLM, written using .NET and WinUI 3☆11Sep 14, 2025Updated 8 months ago
- The hearth of The Pulsar App, fast, secure and shared inference with modern UI☆60Dec 1, 2024Updated last year
- A tool for humans to interact with a Chroma vector database☆16Apr 25, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆215May 9, 2025Updated last year
- Telegram > OpenAI > Read Later [instapaper/pocket/omnivore]☆16Jul 15, 2023Updated 2 years ago
- QLoRA: Efficient Finetuning of Quantized LLMs☆11Jul 22, 2023Updated 2 years ago
- JacQues is a Dash-based interactive web application that facilitates real-time chat and document management.☆23Jan 5, 2026Updated 4 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆58May 17, 2025Updated last year
- LLamaHTML is a simple html file to communicate with a running llamacpp llama-server☆24Aug 5, 2025Updated 9 months ago
- Generate Your Own Private Morning Radio for Commute☆32Feb 5, 2025Updated last year
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆80May 19, 2025Updated last year
- Your personal ArXiv Feed☆23Dec 18, 2024Updated last year
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Diffusion Pipe for Windows For ComfyUI☆28Jan 20, 2026Updated 4 months ago
- The purpose of this repository is to discuss on Audio transformers☆14Apr 16, 2026Updated last month
- The fastest Whisper optimization for automatic speech recognition as a command-line interface ⚡️☆10Dec 3, 2023Updated 2 years ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆435Sep 26, 2025Updated 7 months ago
- A reverse proxy manager written in go, to convert exposed ports into token-based auth protected ports☆20Apr 14, 2025Updated last year
- A car Heads Up Display built using a RGB LED strip and a Teensy microcontroller☆10Jul 5, 2017Updated 8 years ago
- VGDFR: Diffuison-based Video Generation with Dynamic Frame Rate☆18May 16, 2025Updated last year
- Offline LLM chatbot with personalized memory — works on CPU with multi-session memory support.☆22Jan 10, 2026Updated 4 months ago
- Realtime demo, Streaming and Finetuning code for CSM☆455Sep 17, 2025Updated 8 months ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Summarizing with LLMs: Using an LLM to understand GitHub issues without reading each post in detail.☆15Jul 22, 2024Updated last year
- 33B Chinese LLM, DPO QLORA, 100K context, AirLLM 70B inference with single 4GB GPU☆13May 5, 2024Updated 2 years ago
- A 3D simulator for OFS made in godot. Forked.☆12Sep 18, 2024Updated last year
- 🍨 Gelato — From Data Curation to Reinforcement Learning: Building a Strong Grounding Model for Computer-Use Agents☆46Dec 22, 2025Updated 5 months ago
- An example FastAPI server that streams messages from Autogen using OpenAI API format☆15Jul 3, 2024Updated last year
- ☆65Jun 24, 2025Updated 11 months ago
- a character-ai like UI for LLM☆10Dec 3, 2024Updated last year
- Create Unmute voice embeddings☆25Nov 15, 2025Updated 6 months ago
- A highly optimized engine for maya-1 tts model to generate minutes of audio in seconds.☆66Nov 17, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆19Dec 31, 2024Updated last year
- LLMProxy is an intelligent large language model backend routing proxy service.☆25Dec 6, 2025Updated 5 months ago
- Smart proxy for LLM APIs that enables model-specific parameter control, automatic mode switching (like Qwen3's /think and /no_think), and…☆51May 19, 2025Updated last year
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆31May 1, 2025Updated last year
- Bitmap fonts extracted from ZX Spectrum games☆24Nov 8, 2022Updated 3 years ago
- Simple playground chat app that interacts with OpenAI's functions with memory and custom tools.☆17Jul 11, 2023Updated 2 years ago
- "Pacha" TUI (Text User Interface) is a JavaScript application that utilizes the "blessed" library. It serves as a frontend for llama.cpp …☆37Aug 3, 2023Updated 2 years ago