devnen / Dia-TTS-Server
Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), support for SafeTensors/BF16, voice cloning, dialogue generation, and GPU/CPU execution.
☆48Updated this week
Alternatives and similar repositories for Dia-TTS-Server:
Users that are interested in Dia-TTS-Server are comparing it to the libraries listed below
- ☆79Updated 2 months ago
- ☆91Updated 3 months ago
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 6 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆100Updated last week
- A Discord bot for large language models. Add Gemini 2.5 Pro, Claude Sonnet 3.7, GPT 4.1, and other models. Easily change models, edit pro…☆81Updated this week
- API server for Instant voice cloning by MyShell.☆89Updated 7 months ago
- A lightweight UI for chatting with Ollama models. Streaming responses, conversation history, and multi-model support.☆107Updated last month
- OLLama IMage CAtegorizer☆66Updated 3 months ago
- A frontend for creative writing with LLMs☆123Updated 9 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆176Updated this week
- The PyVisionAI Official Repo☆101Updated last month
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 3 weeks ago
- Polyglot is a fast, elegant, and free translation tool using AI.☆60Updated 7 months ago
- Open source LLM UI, compatible with all local LLM providers.☆173Updated 7 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 2 months ago
- Convert Files / Folders / GitHub Repos Into AI / LLM-ready Files☆155Updated 2 months ago
- Something similar to Apple Intelligence?☆60Updated 9 months ago
- List of curated use cases built using Sesame's CSM 1B☆62Updated last month
- An API for VoiceCraft.☆25Updated 9 months ago
- ☆130Updated last week
- Orpheus Chat WebUI☆52Updated 3 weeks ago
- This is a technical writeup of the next evolution in the Adaptive Modular Network. It aims to unify the components of the AMN and fill ga…☆55Updated this week
- RetroChat is a powerful command-line interface for interacting with various AI language models. It provides a seamless experience for eng…☆73Updated 4 months ago
- ☆176Updated 3 weeks ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆34Updated last month
- ☆47Updated 2 months ago
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 11 months ago
- AI stack for interacting with LLMs, Stable Diffusion, Whisper, xTTS and many other AI models☆155Updated 11 months ago
- Open-source Perplexity app.☆120Updated last month
- Convert your PDFs and EPUBs into audiobooks effortlessly. Features intelligent text extraction, customizable text-to-speech settings, and…☆65Updated 3 weeks ago