Haurrus / xtts-trainer-no-ui-autoLinks
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.
☆13Updated last year
Alternatives and similar repositories for xtts-trainer-no-ui-auto
Users that are interested in xtts-trainer-no-ui-auto are comparing it to the libraries listed below
Sorting:
- ☆27Updated 2 years ago
- ☆40Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆48Updated 9 months ago
- Auto-Video maker handling many AI's☆11Updated last year
- Advanced RVC Inference for quicker and effortless model downloads☆64Updated this week
- ☆14Updated last year
- ☆40Updated 2 years ago
- ☆16Updated 2 years ago
- ☆119Updated last week
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 months ago
- ☆75Updated last year
- ☆23Updated last year
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆33Updated last year
- ☆83Updated last year
- Code for the paper "Free-View Expressive Talking Head Video Editing" (ICASSP 2023)☆11Updated last year
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆73Updated 6 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆66Updated last year
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆33Updated last year
- ☆17Updated 11 months ago
- 🎵 LyricWave – AI Music Composer (Proof of Concept) 🎶 A personal project exploring automatic generation of unique MP4 songs. LyricWave b…☆39Updated 3 months ago
- ☆22Updated last year
- ☆19Updated last year
- ☆31Updated 2 years ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆17Updated last year
- ☆24Updated 2 years ago
- AudioLDM text to audio colab☆19Updated 2 years ago
- Performs the entire AI cover generation process with UI☆28Updated 4 months ago
- Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GP…☆40Updated 9 months ago
- OminiControl for the GPU Poor☆39Updated 11 months ago
- ☆20Updated last year