jpgallegoar / Spanish-F5Links
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
☆138Updated 2 months ago
Alternatives and similar repositories for Spanish-F5
Users that are interested in Spanish-F5 are comparing it to the libraries listed below
Sorting:
- ☆224Updated 2 years ago
- Presidential bot built on top of Llama3-8B fine-tune over +100 hours of video interviews☆53Updated last year
- ☆784Updated 4 months ago
- Facturas2json es un programa que te permite extraer datos estructurados a partir de facturas utilizando los modelos Marker y nuExtract.☆38Updated last year
- ☆467Updated 11 months ago
- Building an assistant for Boletin Oficial del Estado (BOE) using Retrieval Augmented Generation (RAG)☆135Updated last year
- Microsoft nos sorprende con Florence-2, una IA que hace de TODO en visión artificial. Detecta objetos en imágenes y videos, segmenta con …☆14Updated last year
- Webui for using XTTS and for finetuning it☆843Updated 8 months ago
- ☆492Updated 7 months ago
- ☆17Updated last year
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speech☆361Updated 10 months ago
- ☆34Updated last year
- ☆169Updated last year
- YuE: Open Full-song Generation Foundation for the GPU Poor☆441Updated 7 months ago
- ☆488Updated 4 months ago
- Slightly improved official version for finetune xtts☆371Updated 6 months ago
- ☆147Updated 11 months ago
- A Collection of Google Colab Notebooks for various projects☆295Updated this week
- just unzip and use it with gradio☆69Updated 8 months ago
- Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation☆207Updated last year
- ☆66Updated 6 months ago
- A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web i…☆224Updated last month
- A Gradio UI for XTTSv2 and RVC.☆66Updated last year
- High-performance Text-to-Speech server with OpenAI-compatible API, 8 voices, emotion tags, and modern web UI. Optimized for RTX GPUs.☆563Updated 3 months ago
- A simple, high-quality voice conversion tool focused on ease of use and performance.☆2,629Updated last week
- [CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis☆1,881Updated 2 weeks ago
- TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching☆786Updated 2 months ago
- A Gradio UI for XTTSv2 and RVC.☆158Updated last year
- A lightweight, self-hosted headless browser automation platform. Designed as an alternative to Browserless, built for speed, privacy, and…☆573Updated 2 weeks ago
- A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats includ…☆800Updated last month