DanRuta / xva-trainerLinks
UI app for training TTS/VC machine learning models for xVASynth, with several audio pre-processing tools, and dataset creation/management.
☆101Updated last year
Alternatives and similar repositories for xva-trainer
Users that are interested in xva-trainer are comparing it to the libraries listed below
Sorting:
- Machine learning based speech synthesis Electron app, with voices from specific characters from video games☆627Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated 11 months ago
- Desktop application for neural speech synthesis written in C++☆215Updated 2 years ago
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆70Updated 2 years ago
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- ☆69Updated 8 months ago
- XTTSv2 Extension for oobabooga text-generation-webui☆154Updated last year
- Full GUI Version☆31Updated 2 years ago
- Oobabooga extension for Bark TTS☆119Updated last year
- A web app that lets you play around with TalkNet models☆121Updated last year
- Audio datasets, easier.☆84Updated last year
- Diffusion_TTS extension for booga☆67Updated last year
- A Gradio UI for XTTSv2 and RVC.☆68Updated 9 months ago
- ☆67Updated 3 months ago
- ☆100Updated 10 months ago
- Made slight modifications to the Tortoise API, provided 3 additional scripts to make using Tortoise easier. Less focus on cloning makes s…☆52Updated last year
- A KoboldAI-like memory extension for oobabooga's text-generation-webui☆108Updated 7 months ago
- A TTS extension for oobabooga text WebUI☆32Updated last year
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆89Updated 11 months ago
- Slightly improved official version for finetune xtts☆73Updated 9 months ago
- Hard Reload oobabooga text WebUI extensions☆18Updated 5 months ago
- Easily run text-to-video diffusion with customized video length, fps, and dimensions on 4GB video cards or on CPU.☆109Updated 9 months ago
- Stable Diffusion UI: Diffusers (CUDA/ONNX)☆130Updated last year
- Mantella is a Skyrim and Fallout 4 mod which allows you to naturally speak to NPCs using Whisper (speech-to-text), LLMs (text generation)…☆277Updated 2 weeks ago
- A simple extension that uses Bark Text-to-Speech for audio output☆34Updated last year
- ☆72Updated last year
- An extension to use Kokoro TTS in text generation webui☆20Updated last month
- ☆83Updated 11 months ago
- A UI for the Piper TTS☆93Updated 9 months ago