Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.
☆345May 31, 2025Updated 9 months ago
Alternatives and similar repositories for Dia-TTS-Server
Users that are interested in Dia-TTS-Server are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆17Jun 28, 2025Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆223Feb 19, 2026Updated last month
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆1,106Feb 12, 2026Updated last month
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆4,591Jan 4, 2026Updated 2 months ago
- OpenAI compatible TTS for Sesame CSM:1b & dia:1.6b - Voice Cloning from File/YT☆434Sep 26, 2025Updated 6 months ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆676Jul 5, 2025Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆129Jul 25, 2025Updated 8 months ago
- A tool-call based memory system for SillyTavern☆30Dec 30, 2025Updated 2 months ago
- ☆24May 28, 2025Updated 9 months ago
- Collection of Python Scripts that Allow Open Web UI to Interact with External APIs☆13Apr 4, 2025Updated 11 months ago
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆77May 19, 2025Updated 10 months ago
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆29May 7, 2025Updated 10 months ago
- audiobook GUI for chatterbox☆37Jul 26, 2025Updated 8 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆51May 20, 2025Updated 10 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- ☆33Updated this week
- ☆56Jun 20, 2025Updated 9 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,290Jan 9, 2026Updated 2 months ago
- The jukebox AI code base with some additional files to make running locally on a machine easier☆11Mar 27, 2024Updated 2 years ago
- Run Orpheus 3B Locally With LM Studio☆525Mar 20, 2025Updated last year
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆293Apr 14, 2025Updated 11 months ago
- Higgs Audio v2 WebUI + One click installer WIN x64☆20Jul 25, 2025Updated 8 months ago
- ☆3,081Updated this week
- A simple FastAPI Server to run XTTSv2☆577Jul 21, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Towards Human-Sounding Speech☆6,016Dec 5, 2025Updated 3 months ago
- 🌟 Full-stack app for real-time avatar streaming with HeyGen & Gemini AI. Built with React, TypeScript, Express, and Tailwind during a ha…☆17Feb 20, 2026Updated last month
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- Real-time webcam demo with SmolVLM(mlx-community/SmolVLM-Instruct-4bit) and MLX-VLM☆25Jun 12, 2025Updated 9 months ago
- ☆46Jun 20, 2025Updated 9 months ago
- Collection of tips for using textgen in various ways☆19Aug 30, 2024Updated last year
- Find the hidden meaning of LLMs☆40Nov 13, 2025Updated 4 months ago
- ☆54May 28, 2025Updated 9 months ago
- Local, OpenAI-compatible text-to-speech (TTS) API using Chatterbox, enabling users to generate voice cloned speech anywhere the OpenAI AP…☆555Dec 23, 2025Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Run Orpheus 3B Locally With LM Studio☆32Mar 20, 2025Updated last year
- Wyoming protocol server for Microsoft Azure text-to-speech☆25Mar 20, 2026Updated last week
- ☆453Nov 2, 2025Updated 4 months ago
- The most feature-complete local AI workstation. Multi-GPU inference, integrated Stable Diffusion + ADetailer, voice cloning, research-gra…☆57Feb 24, 2026Updated last month
- A specialized node for ComfyUI that enable advanced motion and animation capabilities for image as guider for video processing In Hunyuan…☆30Jan 14, 2025Updated last year
- Giving the power of LLM's to a MUD lib.☆186Nov 29, 2025Updated 3 months ago
- List of curated use cases built using Sesame's CSM 1B☆72May 29, 2025Updated 9 months ago