lxe / tts-server
A simple TTS server for generating speech using StyleTTS2
☆38Updated last year
Alternatives and similar repositories for tts-server:
Users that are interested in tts-server are comparing it to the libraries listed below
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 3 weeks ago
- Experience the power of AI with this free AI voice generator demo. Utilizing Deepgram and Groq, we transform text into voice seamlessly. …☆37Updated 10 months ago
- ☆48Updated 5 months ago
- ☆41Updated 2 months ago
- Pybind11 bindings for Whisper.cpp☆55Updated 3 weeks ago
- Example agents I've built using the LiveKit Agents (https://github.com/livekit/agents) framework☆19Updated 11 months ago
- Mobile web app for audio "push-to-talk" + TTS chat interface with OpenAI-like APIs☆43Updated last year
- Dagger functions to import Hugging Face GGUF models into a local ollama instance and optionally push them to ollama.com.☆115Updated 11 months ago
- A basic HTTP API for handling Faster Whisper audio transcriptions over the network☆28Updated 5 months ago
- Attend - to what matters.☆14Updated 2 months ago
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- This repo provides a simple Gradio UI to run Qwen2 VL 72B AWQ in venv and have both image and video inferencing work.☆30Updated 6 months ago
- An API for VoiceCraft.☆25Updated 9 months ago
- Diffusion_TTS extension for booga☆67Updated 10 months ago
- Simulates talk with an AI that can express emotions☆65Updated 9 months ago
- Deploy your GGML models to HuggingFace Spaces with Docker and gradio☆36Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆158Updated 9 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆53Updated 6 months ago
- Open source tool for transcirption and subtitling, alternative to happyscribe.☆26Updated 2 months ago
- Create text chunks which end at natural stopping points without using a tokenizer☆26Updated last month
- Something similar to Apple Intelligence?☆60Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆77Updated 6 months ago
- Run Ollama LLM models in Google Colab for free☆33Updated 5 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆66Updated last year
- ☆24Updated 3 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆152Updated last year
- XTTSv2 Extension for oobabooga text-generation-webui☆34Updated 9 months ago
- Ollama models of NousResearch/Hermes-2-Pro-Mistral-7B-GGUF☆32Updated last year
- ☆83Updated 9 months ago