pinokiofactory / e2-f5-ttsLinks
☆65Updated 3 months ago
Alternatives and similar repositories for e2-f5-tts
Users that are interested in e2-f5-tts are comparing it to the libraries listed below
Sorting:
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆100Updated this week
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆81Updated 9 months ago
- ☆67Updated 4 months ago
- An AI focused photo manipulation tool based on Gradio☆185Updated last month
- Examples of using the llasa-tts models locally☆177Updated 3 months ago
- ☆51Updated 8 months ago
- ACE-Step: A Step Towards Music Generation Foundation Model☆42Updated 2 months ago
- Industry leading face manipulation platform☆331Updated 3 weeks ago
- ☆91Updated 2 months ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆53Updated last year
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆69Updated last year
- Use the Moondream 2 model to detect faces and their gaze directions in videos.☆44Updated 6 months ago
- Gradio UI for YuE☆68Updated 3 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆101Updated 4 months ago
- OpenClap is a file format for the age of AI content production☆119Updated last year
- ☆148Updated last year
- Simple UI for Llama-3.2-11B-Vision & Molmo-7B-D☆136Updated 10 months ago
- Explore, Install, Innovate — in 1 Click.☆32Updated this week
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆66Updated last month
- SoTA open-source TTS☆48Updated last month
- API server for Instant voice cloning by MyShell.☆98Updated 10 months ago
- Sesame Converse - Real Time Conversations - Powered by Gemma 3☆63Updated 4 months ago
- Diffusion_TTS extension for booga☆66Updated last year
- Win & Liunux Gradio WebUI for CSM-1B model by sesame☆51Updated 4 months ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆137Updated last month
- Upscale your videos up to 4k on free google colab using Real-ESRGAN☆185Updated 3 months ago
- ☆185Updated 4 months ago
- ☆21Updated 3 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆356Updated 7 months ago
- Run Replicate models as nodes in ComfyUI☆187Updated 8 months ago