matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆746Updated 2 months ago
Alternatives and similar repositories for openedai-speech:
Users that are interested in openedai-speech are comparing it to the libraries listed below
- ☆1,725Updated this week
- A simple FastAPI Server to run XTTSv2☆498Updated 9 months ago
- Run Orpheus 3B Locally With LM Studio☆367Updated last month
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆251Updated this week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆2,402Updated 2 weeks ago
- Webui for using XTTS and for finetuning it☆780Updated 3 months ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆371Updated last week
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆649Updated 2 weeks ago
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,718Updated 2 weeks ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆734Updated last month
- A Fast TTS Engine☆490Updated 2 months ago
- Interface for OuteTTS models.☆1,178Updated last week
- OpenAI compatible TTS for Sesame CSM:1b - Voice Cloning from File/YT☆287Updated 3 weeks ago
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs☆724Updated 3 weeks ago
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆614Updated 8 months ago
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆544Updated last week
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆313Updated last year
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆338Updated 4 months ago
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆167Updated 3 weeks ago
- Local LLM Powered Recursive Search & Smart Knowledge Explorer☆235Updated 2 months ago
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆160Updated 2 weeks ago
- Dockerfile for WhisperX: Automatic Speech Recognition with Word-Level Timestamps and Speaker Diarization (Dockerfile, CI image build and …☆261Updated last week
- Clara – Privacy-first, client-side AI assistant for Ollama with tool calling & mini n8n-style flow builder. No backend. No data leaks. 10…☆628Updated this week
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆310Updated last month
- Slightly improved official version for finetune xtts☆336Updated 2 weeks ago
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆74Updated 2 months ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆1,259Updated 3 weeks ago
- An AI-powered interactive avatar engine using Live2D, LLM, ASR, TTS, and RVC. Ideal for VTubing, streaming, and virtual assistant applica…☆94Updated this week
- An OAI compatible exllamav2 API that's both lightweight and fast☆915Updated this week
- A local AI companion that uses a collection of free, open source AI models in order to create two virtual companions that will follow you…☆199Updated last week