playht / PlayDiffusionLinks
☆224Updated this week
Alternatives and similar repositories for PlayDiffusion
Users that are interested in PlayDiffusion are comparing it to the libraries listed below
Sorting:
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆231Updated this week
- Kyutai with an "eye"☆197Updated 2 months ago
- A Fast TTS Engine☆506Updated 4 months ago
- ☆404Updated 2 weeks ago
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆261Updated last week
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆38Updated 2 weeks ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆164Updated last month
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆254Updated 2 weeks ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆61Updated 3 weeks ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆96Updated 2 months ago
- A lightweight end-to-end text-to-speech model☆115Updated 3 months ago
- Self-host the powerful Dia TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), suppor…☆218Updated this week
- A random walk voice style cloning application for Kokoro text to speech☆85Updated last week
- List of curated use cases built using Sesame's CSM 1B☆66Updated last week
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆731Updated 3 months ago
- KeySync: A Robust Approach for Leakage-free Lip Synchronization in High Resolution☆308Updated 2 weeks ago
- ☆382Updated 3 weeks ago
- ☆54Updated 8 months ago
- ☆174Updated 2 weeks ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆170Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆227Updated 4 months ago
- Open source inference code for Rev's model☆404Updated last month
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆568Updated last month
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆151Updated last month
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆288Updated last month
- Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible),…☆38Updated this week
- Command-line personal assistant using your favorite proprietary or local models with access to over 30+ tools☆109Updated 2 months ago
- Examples of using the llasa-tts models locally☆171Updated last month
- Building Blocks for Multi-Modal Gradio Powered by Groq Apps☆111Updated 7 months ago
- Googles NotebookLM but local☆267Updated last month