Haurrus / xtts-trainer-no-ui-autoLinks
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.
☆13Updated 8 months ago
Alternatives and similar repositories for xtts-trainer-no-ui-auto
Users that are interested in xtts-trainer-no-ui-auto are comparing it to the libraries listed below
Sorting:
- ☆27Updated last year
- Turn any common eBook file into an HQ Audiobook with F5-TTS (Easy Install)☆25Updated 3 weeks ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆26Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆62Updated 7 months ago
- A lightweight, efficient variation of the StyleTTS 2 text‐to‐speech model.☆21Updated last month
- SadTalker gradio_demo.py file with code section that allows you to set the eye blink and pose reference videos for the software to use wh…☆11Updated 2 years ago
- AudioLDM text to audio colab☆19Updated last year
- ☆39Updated last year
- 🎵 LyricWave – AI Music Composer (Proof of Concept) 🎶 A personal project exploring automatic generation of unique MP4 songs. LyricWave b…☆34Updated 3 months ago
- Auto-Video maker handling many AI's☆11Updated last year
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 7 months ago
- Site for sharing MusicGen + AudioGen Prompts and Creations☆45Updated 2 months ago
- Visual Clip Picker: Trimming Clips by Face Recognition☆42Updated 2 years ago
- Misc. tools/scripts that I made to use for tortoise☆21Updated 10 months ago
- ☆19Updated 9 months ago
- Advanced RVC Inference for quicker and effortless model downloads☆52Updated 2 months ago
- Performs the entire AI cover generation process with UI☆18Updated last month
- ☆40Updated last year
- Ai generated music video with Riffusion and Gradio☆21Updated 2 years ago
- 1 min voice data can also be used to train a good TTS model! (few shot voice cloning)☆26Updated last week
- GUI to sync video mouth movements to match audio, utilizing wav2lip-hq. Completed as part of a technical interview.☆11Updated last year
- VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.☆61Updated 8 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆70Updated 2 years ago
- ☆43Updated last year
- Jupyter notebooks for Inpainting | Outpainting with Flux.1 Fill dev. Able to run on Google Colab Free Tier☆31Updated 6 months ago
- 🎙️ Automatically transcribe audio/video into high-quality, speaker-specific Text-To-Speech datasets ✨☆84Updated last month
- ☆14Updated 11 months ago
- Whisper Speaker Identification (WSI), a cutting-edge model for multilingual speaker identification.☆18Updated 3 months ago
- This project provides a Flask-based API for generating high-quality text-to-speech (TTS) audio using F5-TTS, a flexible and powerful TTS …☆12Updated 3 months ago
- One-shot face animation using webcam, capable of running in real time.☆37Updated last year