rumourscape / F5-TTS
Fork of "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆10Updated 5 months ago
Alternatives and similar repositories for F5-TTS:
Users that are interested in F5-TTS are comparing it to the libraries listed below
- This is an on-CPU real-time conversational system for two-way speech communication with AI models, utilizing a continuous streaming archi…☆117Updated 3 weeks ago
- ☆43Updated 2 weeks ago
- Adds a web API to RVC to infer via json requests☆23Updated 9 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆57Updated 5 months ago
- Implementation of Sesame's Conversational Speech Model for Hugging Face Transformers☆53Updated 3 weeks ago
- Examples of using the llasa-tts models locally☆168Updated 2 weeks ago
- ☆64Updated last month
- ☆108Updated last month
- 🎵 LyricWave - Your AI Music Composer 🎶 Compose Unique MP4 Songs Effortlessly! LyricWave uses AI to create personalized music by harmoni…☆31Updated last month
- ☆269Updated 11 months ago
- This project presents a comprehensive study on video dubbing techniques and the development of a specialized video dubbing system.☆11Updated last year
- ☆30Updated 4 months ago
- List of curated use cases built using Sesame's CSM 1B☆65Updated last month
- ☆96Updated last year
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆44Updated last year
- Gradio UI for YuE☆45Updated last month
- ☆254Updated this week
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆96Updated 2 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆78Updated 6 months ago
- LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM☆242Updated last month
- ☆223Updated last month
- A quality zero-shot lipsync pipeline built with MuseTalk, LivePortrait, and CodeFormer.☆37Updated 7 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆70Updated 10 months ago
- ☆36Updated last year
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆167Updated last month
- 🐍 🤖 Pip installable package for StyleTTS 2 human-level text-to-speech and voice cloning☆159Updated 9 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆324Updated 2 weeks ago
- Text to Speech using Coqui TTS + RVC☆101Updated last year
- Codec for paper: LLaSA: Scaling Train-time and Inference-time Compute for LLaMA-based Speech Synthesis☆264Updated last month
- Running the F5-TTS by ONNX Runtime☆148Updated last week