pinokiofactory / e2-f5-ttsLinks
☆74Updated last week
Alternatives and similar repositories for e2-f5-tts
Users that are interested in e2-f5-tts are comparing it to the libraries listed below
Sorting:
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆106Updated 2 weeks ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆82Updated last year
- ☆44Updated 10 months ago
- Examples of using the llasa-tts models locally☆182Updated 8 months ago
- Lightweight Gradio based WebUI for orpheusTTS - WSL / Linux [CUDA]☆105Updated last month
- Some music tools in ComfyUI☆107Updated 3 weeks ago
- ☆73Updated 9 months ago
- Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.☆77Updated last week
- Industry leading face manipulation platform☆362Updated 2 months ago
- An AI focused photo manipulation tool based on Gradio☆183Updated 6 months ago
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆248Updated last week
- One-click launcher for Audiocraft MusicGen + AudioGen Gradio Web UI☆71Updated 2 years ago
- deep hermes, but decides how to respond based on its OWN decision, no need for system prompts.☆38Updated 9 months ago
- ⚡ AI Avatar Factory is an interface for creating and managing AI avatars. ⚡☆63Updated 4 months ago
- Automated speech dataset creator☆213Updated 6 months ago
- StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models☆37Updated 7 months ago
- SoTA open-source TTS for Audiobook and Podcast Generation☆177Updated 6 months ago
- OminiControl for the GPU Poor☆39Updated 11 months ago
- MFLUX-WEBUI using MLX and the FLUX DEV and Schnell models☆109Updated last month
- Upscale your videos up to 4k on free google colab using Real-ESRGAN☆200Updated 8 months ago
- ☆51Updated last year
- ☆185Updated 9 months ago
- A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.☆46Updated 2 years ago
- ☆148Updated last year
- Custom ComfyUI nodes for our community☆129Updated last week
- Gradio UI for YuE☆85Updated 8 months ago
- A high quality and fast TTS repository☆358Updated last week
- Industry leading face manipulation platform☆142Updated 2 months ago
- ☆20Updated 8 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆69Updated 6 months ago