gitmylo/audio-webui

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/gitmylo/audio-webui)

gitmylo / audio-webui

A webui for different audio related Neural Networks

☆1,243

Alternatives and similar repositories for audio-webui

Users that are interested in audio-webui are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

gitmylo / bark-voice-cloning-HuBERT-quantizer
View on GitHub
The code for the bark-voicecloning model. Training and inference.
☆711Sep 13, 2023Updated 2 years ago
rsxdalv / TTS-WebUI
View on GitHub
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro,…
☆3,211Jul 6, 2026Updated 2 weeks ago
C0untFloyd / bark-gui
View on GitHub
🔊 Text-Prompted Generative Audio Model with Gradio
☆689Nov 23, 2023Updated 2 years ago
neonbjb / tortoise-tts
View on GitHub
A multi-voice TTS system trained with an emphasis on quality
☆14,866Nov 19, 2024Updated last year
serp-ai / bark-with-voice-clone
View on GitHub
🔊 Text-prompted Generative Audio Model - With the ability to clone voices
☆3,339Aug 24, 2025Updated 11 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
JonathanFly / bark
View on GitHub
🚀 BARK INFINITY GUI CMD 🎶 Powered Up Bark Text-prompted Generative Audio Model
☆1,006Oct 21, 2023Updated 2 years ago
yl4579 / StyleTTS2
View on GitHub
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
☆6,314Aug 10, 2024Updated last year
daswer123 / xtts-webui
View on GitHub
Webui for using XTTS and for finetuning it
☆891Jan 17, 2025Updated last year
oobabooga / textgen
View on GitHub
Open-source desktop app for local LLMs. Text, vision, tool-calling, OpenAI/Anthropic-compatible API. 100% private.
☆47,484Jun 2, 2026Updated last month
suno-ai / bark
View on GitHub
🔊 Text-Prompted Generative Audio Model
☆39,213Aug 19, 2024Updated last year
erew123 / alltalk_tts
View on GitHub
AllTalk is based on the Coqui TTS engine, similar to the Coqui_tts extension for Text generation webUI, however supports a variety of adv…
☆2,417Jan 9, 2026Updated 6 months ago
SayanoAI / RVC-Studio
View on GitHub
The best looking and most functional webui for RVC related tasks. See website for UI demo:
☆223Apr 27, 2024Updated 2 years ago
GrandaddyShmax / audiocraft_plus
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆640Aug 15, 2024Updated last year
facebookresearch / audiocraft
View on GitHub
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor…
☆23,518Mar 3, 2026Updated 4 months ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
atxcowboy / megasearch
View on GitHub
A plugin for Oobabooga TextUI that allows you to search multiple search engines. Initially we're using Google API or DuckDuckGo.
☆18Jun 4, 2023Updated 3 years ago
coqui-ai / TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆45,796Aug 16, 2024Updated last year
IAHispano / Applio
View on GitHub
A simple, high-quality voice conversion tool focused on ease of use and performance.
☆3,509Updated this week
vladmandic / sdnext
View on GitHub
SD.Next: All-in-one WebUI for AI generative image and video creation, captioning and processing
☆7,182Updated this week
voice-cloning-app / Voice-Cloning-App
View on GitHub
A Python/Pytorch app for easily synthesising human voices
☆1,441Dec 2, 2024Updated last year
haoheliu / AudioLDM2
View on GitHub
Text-to-Audio/Music Generation
☆2,636Sep 29, 2024Updated last year
Nerogar / OneTrainer
View on GitHub
OneTrainer is a one-stop solution for all your Diffusion training needs.
☆3,126Updated this week
RVC-Project / Retrieval-based-Voice-Conversion-WebUI
View on GitHub
Easily train a good VC model with voice data <= 10 mins!
☆36,618Updated this week
Mangio621 / Mangio-RVC-Fork
View on GitHub
*CREPE+HYBRID TRAINING* A very experimental fork of the Retrieval-based-Voice-Conversion-WebUI repo that incorporates a variety of other …
☆1,227Sep 27, 2023Updated 2 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
wsippel / bark_tts
View on GitHub
Oobabooga extension for Bark TTS
☆117Nov 23, 2023Updated 2 years ago
152334H / DL-Art-School
View on GitHub
TorToiSe fine-tuning with DLAS
☆224Aug 1, 2024Updated last year
numz / sd-wav2lip-uhq
View on GitHub
Wav2Lip UHQ extension for Automatic1111
☆1,420Jun 14, 2024Updated 2 years ago
haoheliu / AudioLDM
View on GitHub
AudioLDM: Generate speech, sound effects, music and beyond, with text.
☆2,908Jun 25, 2025Updated last year
jasonppy / VoiceCraft
View on GitHub
Zero-Shot Speech Editing and Text-to-Speech in the Wild
☆8,503May 30, 2026Updated last month
152334H / tortoise-tts-fast
View on GitHub
Fast TorToiSe inference (5x or your money back!)
☆826Jul 10, 2024Updated 2 years ago
YanniKouloumbis / next-js-window-ai
View on GitHub
A Next.js chatbot app demonstrating seamless integration with window.ai.
☆15Jun 25, 2023Updated 3 years ago
1aienthusiast / audiocraft-infinity-webui
View on GitHub
☆172Aug 14, 2023Updated 2 years ago
Stability-AI / stable-audio-tools
View on GitHub
Generative models for conditional audio generation
☆3,826Updated this week
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
painebenjamin / app.enfugue.ai
View on GitHub
ENFUGUE is an open-source web app for making studio-grade images and video using generative AI.
☆759Oct 29, 2024Updated last year
dunky11 / voicesmith
View on GitHub
[WIP] VoiceSmith makes training text to speech models easy.
☆231Oct 10, 2022Updated 3 years ago
FartyPants / Playground
View on GitHub
Text WebUI extension to add clever Notebooks to Chat mode
☆148Aug 7, 2025Updated 11 months ago
SociallyIneptWeeb / AICoverGen
View on GitHub
A WebUI to create song covers with any RVC v2 trained AI voice from YouTube videos or audio files.
☆1,432Feb 15, 2025Updated last year
invoke-ai / InvokeAI
View on GitHub
Invoke is a leading creative engine for Stable Diffusion models, empowering professionals, artists, and enthusiasts to generate and creat…
☆27,651Updated this week
JarodMica / ai-voice-cloning
View on GitHub
☆780Jun 9, 2025Updated last year
rsxdalv / one-click-installers-tts
View on GitHub
Simplified installers for suno-ai/bark, musicgen, tortoise, RVC, demucs and vocos
☆47Jul 6, 2024Updated 2 years ago