C0untFloyd / bark-guiLinks
π Text-Prompted Generative Audio Model with Gradio
β690Updated last year
Alternatives and similar repositories for bark-gui
Users that are interested in bark-gui are comparing it to the libraries listed below
Sorting:
- The code for the bark-voicecloning model. Training and inference.β698Updated last year
- π BARK INFINITY GUI CMD πΆ Powered Up Bark Text-prompted Generative Audio Modelβ1,008Updated last year
- A webui for different audio related Neural Networksβ1,167Updated last week
- A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice,β¦β2,205Updated this week
- π Text-prompted Generative Audio Modelβ234Updated 2 years ago
- Wav2Lip UHQ extension for Automatic1111β1,375Updated 11 months ago
- Extended faceswap extension for StableDiffusion web-ui with multiple faceswaps, inpainting, checkpoints, ....β805Updated 9 months ago
- AUTOMATIC1111 UI extension for creating videos using img2img and ebsynth.β1,283Updated last year
- Fast TorToiSe inference (5x or your money back!)β814Updated 10 months ago
- Automaticaly detects faces and replaces themβ337Updated last year
- This script allows to automate video stylization task using StableDiffusion and ControlNet.β815Updated last year
- High quality Lip syncβ1,116Updated 10 months ago
- An all in one solution for adding Temporal Stability to a Stable Diffusion Render via an automatic1111 extensionβ1,957Updated last year
- Webui for using XTTS and for finetuning itβ808Updated 4 months ago
- Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressorβ¦β609Updated 9 months ago
- A User Interface for XTTS-2 Text-Based Voice Cloning using only 10 seconds of speechβ344Updated 5 months ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversionβ669Updated 4 months ago
- Audio Slicer that uses silence detection to split .wav audio files into multiple .wav samples.β300Updated last year
- High-Fidelity Lip-Syncing with Wav2Lip and Real-ESRGANβ454Updated last year
- π Text-prompted Generative Audio Model - With the ability to clone voicesβ3,295Updated 11 months ago
- A simple Stable Diffusion WebUI extension that adds a Photopea tab and integration.β816Updated 6 months ago
- infinite zoom effect extension for AUTOMATIC1111's webui - stable diffusionβ673Updated last year
- Latent Consistency Model for AUTOMATIC1111 Stable Diffusion WebUIβ614Updated last year
- A simple FastAPI Server to run XTTSv2β513Updated 10 months ago
- A hub dedicated to development and upkeep of the Sytan SDXL workflow for ComfyUIβ417Updated last year
- β1,733Updated 11 months ago
- π Text-Prompted Generative Audio Modelβ91Updated 2 years ago
- Auto1111 extension implementing text2video diffusion models (like ModelScope or VideoCrafter) using only Auto1111 webui dependenciesβ1,318Updated 10 months ago
- singing voice change based on whisper, and lora for singing voice cloneβ637Updated last year
- Removes backgrounds from pictures. Extension for webui.β1,304Updated last year