playht / PlayDiffusionLinks
☆500Updated 3 weeks ago
Alternatives and similar repositories for PlayDiffusion
Users that are interested in PlayDiffusion are comparing it to the libraries listed below
Sorting:
- Kyutai with an "eye"☆207Updated 3 months ago
- ☆439Updated last month
- VoiceStar: Robust, Duration-controllable TTS that can Extrapolate☆261Updated last month
- ☆620Updated 3 weeks ago
- A Fast TTS Engine☆525Updated 5 months ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆756Updated last month
- ☆605Updated 2 weeks ago
- ☆426Updated 2 months ago
- Fast Streaming TTS with Orpheus + WebRTC (with FastRTC)☆298Updated 3 months ago
- Open source inference code for Rev's model☆411Updated 2 months ago
- The official repo for paper "Spatial Speech Translation: Translating Across Space With Binaural Hearables"☆64Updated 2 months ago
- High-quality Text-to-Audio Generation with Efficient Diffusion Transformer☆301Updated 2 weeks ago
- LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆586Updated 3 months ago
- Make text LLMs listen and speak☆512Updated last week
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆97Updated 3 months ago
- Googles NotebookLM but local☆315Updated 2 months ago
- Examples of using the llasa-tts models locally☆175Updated 2 months ago
- A TTS model capable of generating ultra-realistic dialogue in one pass.☆189Updated 2 months ago
- Streaming and Fine-tuning for Chatterbox TTS☆128Updated last month
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆265Updated 2 months ago
- Sesame CSM 1B Voice Cloning☆312Updated 4 months ago
- Implementation of F5-TTS in MLX☆561Updated 3 months ago
- Interface for OuteTTS models.☆1,332Updated 3 weeks ago
- Run Orpheus 3B Locally With LM Studio☆432Updated 3 months ago
- VoiceRestore: Flow-Matching Transformers for Universal Speech Restoration☆175Updated 2 months ago
- G2P☆272Updated 2 months ago
- Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking Portrait☆269Updated last month
- A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.☆232Updated 5 months ago
- ☆546Updated last week
- Have a natural voice conversation with an LLM☆250Updated 7 months ago