PasiKoodaa / F5-TTSLinks
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆79Updated 7 months ago
Alternatives and similar repositories for F5-TTS
Users that are interested in F5-TTS are comparing it to the libraries listed below
Sorting:
- Extract voice segments of a target speaker from podcasts - Useful for creating speech datasets☆126Updated last week
- Orpheus Chat WebUI☆62Updated 2 months ago
- ☆67Updated 2 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆96Updated 2 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆62Updated 2 months ago
- ☆95Updated last year
- An AI focused photo manipulation tool based on Gradio☆182Updated last week
- Examples of using the llasa-tts models locally☆171Updated last month
- ☆50Updated 6 months ago
- Since the owner of the repo took it down and it used an MIT license, I guess it's okay to upload it here for people to use.☆44Updated 2 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆44Updated 2 months ago
- A random walk voice style cloning application for Kokoro text to speech☆85Updated last week
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆96Updated 2 weeks ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆40Updated last week
- ☆54Updated 6 months ago
- Deploy Apollo HF space locally☆40Updated 5 months ago
- After my server ui improvements were successfully merged, consider this repo a playground for experimenting, tinkering and hacking around…☆53Updated 9 months ago
- Run Orpheus 3B Locally with Gradio UI, Standalone App☆21Updated 2 months ago
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆40Updated 4 months ago
- Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. F…☆151Updated last month
- Realtime tts reading of large textfiles by your favourite voice. +Translation via LLM (Python script)☆52Updated 7 months ago
- Streaming for Chatterbox TTS☆48Updated this week
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆53Updated 11 months ago
- Adding a multi-text multi-speaker script (diffe) that is based on a script from asiff00 on issue 61 for Sesame: A Conversational Speech G…☆23Updated 2 months ago
- Gradio Demo for ComfyDeploy☆53Updated 9 months ago
- ☆63Updated last month
- ☆43Updated 4 months ago
- Diffusion_TTS extension for booga☆67Updated 11 months ago
- This extension enhances the capabilities of textgen-webui by integrating advanced vision models, allowing users to have contextualized co…☆54Updated 7 months ago