fedirz / parler-tts-serverLinks
☆26Updated last year
Alternatives and similar repositories for parler-tts-server
Users that are interested in parler-tts-server are comparing it to the libraries listed below
Sorting:
- ☆75Updated last year
- Site for sharing Bark voices☆51Updated 5 months ago
- Examples of using the llasa-tts models locally☆180Updated 4 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆101Updated 5 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- API server for Instant voice cloning by MyShell.☆102Updated 11 months ago
- A curated list of amazing RunPod projects, libraries, and resources☆121Updated last year
- ☆67Updated 5 months ago
- ☆99Updated last year
- Diffusion_TTS extension for booga☆66Updated last year
- ☆57Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆80Updated 10 months ago
- OminiControl for the GPU Poor☆37Updated 7 months ago
- ☆26Updated last year
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆101Updated last week
- ☆91Updated 3 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆155Updated last year
- ☆45Updated 7 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- An updated specification for AI character cards.☆129Updated 2 years ago
- Oobabooga extension for Bark TTS☆118Updated last year
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37Updated 3 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆171Updated 2 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆54Updated last year
- Gradio UI for YuE☆71Updated 5 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆57Updated 10 months ago
- LLaSA WebUI using ExLlamaV2 and FastAPI.☆27Updated 5 months ago
- A simple framework for using a local Koboldcpp LLM to help with story-writing☆24Updated last year
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated last year
- Accepts a Hugging Face model URL, automatically downloads and quantizes it using Bits and Bytes.☆38Updated last year