jpgallegoar / Spanish-F5Links
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆142Updated last month
Alternatives and similar repositories for Spanish-F5
Users that are interested in Spanish-F5 are comparing it to the libraries listed below
Sorting:
- Presidential bot built on top of Llama3-8B fine-tune over +100 hours of video interviews☆53Updated last year
- ☆781Updated 6 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆377Updated last year
- Webui for using XTTS and for finetuning it☆859Updated 10 months ago
- A Gradio UI for XTTSv2 and RVC.☆66Updated last year
- ☆509Updated last week
- Slightly improved official version for finetune xtts☆378Updated 8 months ago
- Facturas2json es un programa que te permite extraer datos estructurados a partir de facturas utilizando los modelos Marker y nuExtract.☆40Updated last year
- A Gradio UI for XTTSv2 and RVC.☆159Updated last year
- Webui for using XTTS and for finetuning it☆115Updated last year
- ☆148Updated last year
- ☆224Updated 2 years ago
- YuE: Open Full-song Generation Foundation for the GPU Poor☆453Updated 10 months ago
- A simple FastAPI Server to run XTTSv2☆562Updated last year
- ☆34Updated last year
- Slightly improved official version for finetune xtts☆70Updated last year
- ☆169Updated last year
- ☆472Updated last year
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆245Updated 3 weeks ago
- A webui for different audio related Neural Networks☆1,217Updated 6 months ago
- ☆498Updated 9 months ago
- ☆17Updated last year
- ☆71Updated 8 months ago
- just unzip and use it with gradio☆75Updated 10 months ago
- Microsoft nos sorprende con Florence-2, una IA que hace de TODO en visión artificial. Detecta objetos en imágenes y videos, segmenta con …☆14Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆52Updated 11 months ago
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆614Updated 5 months ago
- VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)☆844Updated this week
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆208Updated last year
- 🟢 NVIDIA ONLY – All-in-One TTS App with Kokoro, KittenTTS, Higgs audio, Chatterbox, Fish-Speech, F5 & index-tts & indextts2, Supports Co…☆106Updated last week