DanRuta / xva-trainer
UI app for training TTS/VC machine learning models for xVASynth, with several audio pre-processing tools, and dataset creation/management.
☆99Updated last year
Alternatives and similar repositories for xva-trainer:
Users that are interested in xva-trainer are comparing it to the libraries listed below
- Machine learning based speech synthesis Electron app, with voices from specific characters from video games☆618Updated 11 months ago
- Full GUI Version☆31Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated last year
- Audio datasets, easier.☆83Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆45Updated 8 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆152Updated last year
- ☆66Updated 5 months ago
- A Gradio UI for XTTSv2 and RVC.☆156Updated 10 months ago
- Mantella is a Skyrim and Fallout 4 mod which allows you to naturally speak to NPCs using Whisper (speech-to-text), LLMs (text generation)…☆248Updated this week
- A web app that lets you play around with TalkNet models☆118Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- A Gradio UI for XTTSv2 and RVC.☆68Updated 6 months ago
- Oobabooga extension for Bark TTS☆118Updated last year
- Diffusion_TTS extension for booga☆67Updated 9 months ago
- Slightly improved official version for finetune xtts☆71Updated 6 months ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated 11 months ago
- TorToiSe fine-tuning with DLAS☆218Updated 8 months ago
- For our homelab server☆31Updated 2 years ago
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆50Updated 9 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆82Updated 8 months ago
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- A high-throughput and memory-efficient inference and serving engine for LLMs (Windows build & kernels)☆20Updated last week
- 🔊 Text-prompted Generative Audio Model☆237Updated last year
- Stable Diffusion UI - 2.1 + Inpainting + ∞ - ONNX for CPU or AMD graphic card☆30Updated 2 years ago
- Fast and memory-efficient exact attention - Windows wheels☆33Updated last year
- Faster Tortoise inference then Tortoise Fast Fork☆128Updated 11 months ago
- Stable Diffusion UI: Diffusers (CUDA/ONNX)☆131Updated last year
- ☆59Updated last week
- Easily run text-to-video diffusion with customized video length, fps, and dimensions on 4GB video cards or on CPU.☆108Updated 6 months ago