Haurrus / xtts-trainer-no-ui-auto
This project includes a Python script for fine-tuning a text-to-speech (TTS) model. The script utilizes custom datasets and use CUDA for accelerated training.
☆13Updated last month
Related projects ⓘ
Alternatives and complementary repositories for xtts-trainer-no-ui-auto
- Advanced RVC Inference for quicker and effortless model downloads☆32Updated 8 months ago
- AudioLDM text to audio colab☆19Updated last year
- Site for sharing MusicGen + AudioGen Prompts and Creations☆39Updated 4 months ago
- ☆26Updated 11 months ago
- Auto-Video maker handling many AI's☆12Updated 8 months ago
- ☆77Updated 4 months ago
- Misc. tools/scripts that I made to use for tortoise☆18Updated 3 months ago
- ☆34Updated 6 months ago
- Multivoice: Enhance your foreign-language movie and TV show experience with personalized dubbed versions. Our project uses voice cloning …☆24Updated last year
- Text prompt steered synthetic audio generators☆45Updated 11 months ago
- ☆40Updated last year
- Create training data for training a voice cloner for bark text to speech.☆44Updated last year
- Using RVC via console or python scripts☆78Updated last month
- ☆14Updated 4 months ago
- The best gradio web-ui for creating cover song that uses mdx-net and rvc. Easy one click installation. Fully portable.☆14Updated last week
- Versatile AI-driven audio upscaler to enhance the quality of any audio.☆60Updated 2 months ago
- High-performance ASR tool using Faster Whisper, supporting custom models, multi-language transcription, and real-time processing feedback…☆10Updated 3 weeks ago
- Uses deepgram/whisper/custom models to create an LJSpeech dataset for voice model fine tuning☆12Updated 2 weeks ago
- ☆15Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆28Updated this week
- GUI for a Vocal Remover that uses Deep Neural Networks.☆13Updated 10 months ago
- Orchestrating AI for stunning lip-synced videos. Effortless workflow, exceptional results, all in one place.☆64Updated 4 months ago
- Auto-Lyrics: Lyrics transcription & alignment using Whisper and yt-dlp☆17Updated last month
- Towards Robust Blind Face Restoration with Codebook Lookup Transformer☆27Updated 10 months ago
- Text-to-Music Generation with Rectified Flow Transformer☆48Updated 2 months ago
- ☆18Updated 11 months ago
- This repository contains the code and data for the paper EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control by Haozhe Chen,…☆43Updated last month
- Official code of the paper: Draw an Audio: Leveraging Multi-Instruction for Video-to-Audio Synthesis.☆40Updated 2 months ago
- ☆54Updated 10 months ago
- (Windows/Linux/MacOS) Local WebUI with neural network models (Text, Image, Video, 3D, Audio) on python (Gradio interface). Translated on …☆73Updated last week