playht / PlayDiffusionLinks
☆516Updated last month
Alternatives and similar repositories for PlayDiffusion
Users that are interested in PlayDiffusion are comparing it to the libraries listed below
Sorting:
- VibeVoice Community Fork: Long-form conversational TTS☆418Updated this week
- ☆246Updated 3 weeks ago
- ☆458Updated 4 months ago
- Kyutai with an "eye"☆218Updated 5 months ago
- ☆632Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆282Updated 3 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆784Updated last month
- VLLM Port of the Chatterbox TTS model☆299Updated last week
- VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning☆584Updated this week
- ☆451Updated 4 months ago
- Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support…☆299Updated this week
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆308Updated 5 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆308Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆182Updated 3 months ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆613Updated 5 months ago
- Open Audio Watermarking Tool☆276Updated 2 months ago
- ☆280Updated last month
- A Fast TTS Engine☆543Updated 7 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆67Updated last month
- ☆842Updated last week
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆192Updated 5 months ago
- Open source inference code for Rev's model☆428Updated 4 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆237Updated 5 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆206Updated 4 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆278Updated 4 months ago
- Unofficial WIP LoRa Finetuning repository for VibeVoice☆116Updated this week
- Examples of using the llasa-tts models locally☆180Updated 5 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆103Updated 6 months ago
- G2P☆316Updated last month
- ☆288Updated 2 months ago