matatonic / openedai-speech
An OpenAI API compatible text to speech server using Coqui AI's xtts_v2 and/or piper tts as the backend.
☆742Updated 2 months ago
Alternatives and similar repositories for openedai-speech:
Users that are interested in openedai-speech are comparing it to the libraries listed below
- ☆1,675Updated this week
- Local AI talk with a custom voice based on Zephyr 7B model. Uses RealtimeSTT with faster_whisper for transcription and RealtimeTTS with C…☆612Updated 8 months ago
- AlwaysReddy is a LLM voice assistant that is always just a hotkey away.☆732Updated last month
- A simple FastAPI Server to run XTTSv2☆495Updated 8 months ago
- OpenAI compatible TTS for Sesame CSM:1b - Voice Cloning from File/YT☆275Updated 3 weeks ago
- Interface for OuteTTS models.☆1,111Updated this week
- Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching☆2,347Updated last week
- AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…☆1,681Updated this week
- An OpenAI API compatible speech to text server for audio transcription and translations, aka. Whisper.☆73Updated 2 months ago
- Webui for using XTTS and for finetuning it☆776Updated 2 months ago
- a self-hosted webui for 30+ generative ai☆570Updated this week
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆232Updated this week
- Model swapping for llama.cpp (or any local OpenAPI compatible server)☆506Updated last week
- A Fast TTS Engine☆483Updated 2 months ago
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆215Updated 2 months ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆361Updated last week
- https://hf.co/hexgrad/Kokoro-82M☆2,268Updated this week
- What If Language Models Expertly Routed All Inference? WilmerAI allows prompts to be routed to specialized workflows based on the domain …☆640Updated last week
- Local SRT/LLM/TTS Voicechat☆658Updated 6 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆336Updated 4 months ago
- Free, high-quality text-to-speech API endpoint to replace OpenAI, Azure, or ElevenLabs☆684Updated last week
- plug whisper audio transcription to a local ollama server and ouput tts audio responses☆312Updated last year
- Slightly improved official version for finetune xtts☆335Updated last week
- Run Orpheus 3B Locally With LM Studio☆362Updated 3 weeks ago
- Your Trusty Memory-enabled AI Companion - Simple RAG chatbot optimized for local LLMs | 12 Languages Supported | OpenAI API Compatible☆309Updated last month
- A Conversational Speech Generation Model with Gradio UI and OpenAI compatible API. UI and API support CUDA, MLX and CPU devices.☆165Updated 3 weeks ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆1,228Updated 3 weeks ago
- Command Your World with Voice☆642Updated 4 months ago
- A talking LLM that runs on your own computer without needing the internet.☆432Updated 7 months ago
- An OAI compatible exllamav2 API that's both lightweight and fast☆901Updated 3 weeks ago