DanRuta / xva-trainerLinks
UI app for training TTS/VC machine learning models for xVASynth, with several audio pre-processing tools, and dataset creation/management.
☆99Updated last year
Alternatives and similar repositories for xva-trainer
Users that are interested in xva-trainer are comparing it to the libraries listed below
Sorting:
- Machine learning based speech synthesis Electron app, with voices from specific characters from video games☆626Updated last year
- Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos☆46Updated last year
- GradioUI for TortoiseTTS voice generation☆34Updated last year
- A web app that lets you play around with TalkNet models☆121Updated last year
- Oobabooga extension for Bark TTS☆119Updated last year
- AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)☆71Updated 2 years ago
- ☆70Updated 8 months ago
- A Gradio setup for Tortoise TTS.☆45Updated 2 years ago
- XTTSv2 Extension for oobabooga text-generation-webui☆155Updated last year
- Full GUI Version☆31Updated 2 years ago
- ☆49Updated this week
- Audio datasets, easier.☆84Updated last year
- ☆101Updated 11 months ago
- 🔊 Text2Speech, Voice-Cloning and Voice2Voice conversion with the text-prompted generative audio model bark☆66Updated 2 weeks ago
- TTS pipeline that uses RVC to enhance audio quality and cloning☆145Updated last year
- The best looking and most functional webui for RVC related tasks. See website for UI demo:☆209Updated last year
- ☆73Updated last year
- Diffusion_TTS extension for booga☆67Updated last year
- Stable Diffusion UI: Diffusers (CUDA/ONNX)☆130Updated last year
- This is a modified version of NVIDIA's TalkNet. It is a controllable network that can be used for both CPU and GPU inference.☆45Updated last year
- 🔊 Text-prompted Generative Audio Model☆235Updated 2 years ago
- ☆75Updated 2 years ago
- Mantella is a Skyrim and Fallout 4 mod which allows you to naturally speak to NPCs using Whisper (speech-to-text), LLMs (text generation)…☆282Updated 2 weeks ago
- A UI for the Piper TTS☆93Updated 10 months ago
- Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)☆278Updated last week
- A Gradio UI for XTTSv2 and RVC.☆159Updated last year
- A TTS extension for oobabooga text WebUI☆32Updated last year
- AI powered speech denoising and enhancement. Adapted for windows and optimized☆89Updated last year
- Listen to any audio stream on your machine and print out the transcribed or translated audio.☆120Updated last year
- ☆17Updated 3 months ago