Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.
☆343May 31, 2025Updated 9 months ago
Alternatives and similar repositories for Dia-TTS-Server
Users that are interested in Dia-TTS-Server are comparing it to the libraries listed below
Sorting:
- OpenAI compatible API for Dia-1.6B☆36Apr 27, 2025Updated 10 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆17Jun 28, 2025Updated 8 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆220Feb 19, 2026Updated last week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆4,484Jan 4, 2026Updated last month
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆1,053Feb 12, 2026Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆19,135Nov 19, 2025Updated 3 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆663Jul 5, 2025Updated 7 months ago
- A functioning Sesame CSM project with a desktop GUI - Real-time factor: 0.6x with 4070 Ti Super - Requires only 8GB VRAM☆78May 19, 2025Updated 9 months ago
- HTML parsing and searching tool☆20Sep 27, 2025Updated 5 months ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆2,259Jan 9, 2026Updated last month
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆128Jul 25, 2025Updated 7 months ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Jun 12, 2024Updated last year
- 🤖 nGPT - A lightning-fast CLI tool that brings any OpenAI-compatible LLM (OpenAI, Ollama, Groq, Claude, Gemini) directly to your termina…☆42Feb 20, 2026Updated last week
- ☆33Feb 19, 2026Updated last week
- Run Orpheus 3B Locally With LM Studio☆517Mar 20, 2025Updated 11 months ago
- SoTA open-source TTS☆136Jun 7, 2025Updated 8 months ago
- An open source deep research clone. AI Agent (Local LLM or Gemini) that reasons large amounts of web data extracted with SwiftSoup.☆13Feb 10, 2025Updated last year
- Towards Human-Sounding Speech☆5,968Dec 5, 2025Updated 2 months ago
- ComfyUI Dia text to speech☆14May 29, 2025Updated 9 months ago
- Collection of Python Scripts that Allow Open Web UI to Interact with External APIs☆12Apr 4, 2025Updated 10 months ago
- ☆11Aug 1, 2024Updated last year
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆12May 26, 2024Updated last year
- 🌟 Full-stack app for real-time avatar streaming with HeyGen & Gemini AI. Built with React, TypeScript, Express, and Tailwind during a ha…☆17Feb 20, 2026Updated last week
- A simple library for loading word2vec binary model.☆12Sep 17, 2015Updated 10 years ago
- LTX-Video-Trainer-GUI 是为LTX视频lora模型训练提供的GUI工具,支持通过简单的界面训练 LoRA 模型用于视频生成。本训练器提供了直观的 GUI 界面,使用户能够轻松设置和启动训练流程,无需编写复杂代码。☆13Jul 18, 2025Updated 7 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆49May 20, 2025Updated 9 months ago
- Use a video and cut out portions of it without re-mounting the video inbetween.☆16Sep 23, 2024Updated last year
- Speech Assessment API in FastAPI with HuggingFace 🤗☆13May 18, 2025Updated 9 months ago
- The jukebox AI code base with some additional files to make running locally on a machine easier☆11Mar 27, 2024Updated last year
- example apps for inference.sh☆20Updated this week
- A simple FastAPI Server to run XTTSv2☆573Jul 21, 2024Updated last year
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆289Apr 14, 2025Updated 10 months ago
- An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.☆852Feb 2, 2025Updated last year
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆349Apr 10, 2025Updated 10 months ago
- Dia-JAX: A JAX port of Dia, the text-to-speech model for generating realistic dialogue from text with emotion and tone control.☆30May 7, 2025Updated 9 months ago
- ☆2,973Feb 24, 2026Updated last week
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆81Oct 3, 2024Updated last year
- ☆56Jun 20, 2025Updated 8 months ago
- ☆46Jun 20, 2025Updated 8 months ago