DanRuta / xva-trainer
UI app for training TTS/VC machine learning models for xVASynth, with several audio pre-processing tools, and dataset creation/management.
☆100Updated last year
Alternatives and similar repositories for xva-trainer
Users that are interested in xva-trainer are comparing it to the libraries listed below
Sorting:
- Machine learning based speech synthesis Electron app, with voices from specific characters from video games☆624Updated last year
- Oobabooga extension for Bark TTS☆117Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆45Updated 10 months ago
- Audio datasets, easier.☆84Updated last year
- Full GUI Version☆31Updated 2 years ago
- XTTSv2 Extension for oobabooga text-generation-webui☆153Updated last year
- A very simple implementation of edge_tts w/ RVC for oobabooga text-generation-webui.☆41Updated last year
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- A web app that lets you play around with TalkNet models☆119Updated last year
- TorToiSe fine-tuning with DLAS☆220Updated 9 months ago
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆68Updated last year
- A Lightweight Gradio Web interface for Text-to-Audio Generation utilising SAO1.0☆51Updated 11 months ago
- The best looking and most functional webui for RVC related tasks. See website for UI demo:☆203Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- RVC Inference with multiple model and huggingface support☆105Updated last year
- ☆68Updated 6 months ago
- Diffusion_TTS extension for booga☆67Updated 10 months ago
- Mantella is a Skyrim and Fallout 4 mod which allows you to naturally speak to NPCs using Whisper (speech-to-text), LLMs (text generation)…☆264Updated this week
- A very basic bot for generating Stable Diffusion images via the text-generation-webui☆73Updated last year
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- C++ library for converting text to phonemes for Piper☆118Updated last year
- A Gradio UI for XTTSv2 and RVC.☆157Updated 11 months ago
- Quick webui for audiocraft☆156Updated 2 months ago
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆85Updated 10 months ago
- A TTS extension for oobabooga text WebUI☆31Updated last year
- ☆96Updated last year
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated last year
- A simple extension that uses Bark Text-to-Speech for audio output☆35Updated last year
- ☆66Updated last month
- ☆83Updated 10 months ago