Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models
☆608Oct 18, 2025Updated 4 months ago
Alternatives and similar repositories for UVR5-UI
Users that are interested in UVR5-UI are comparing it to the libraries listed below
Sorting:
- Ultimate Vocal Remover CLI type for Google Colab☆68Aug 16, 2025Updated 6 months ago
- Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (pr…☆1,049Jan 24, 2026Updated last month
- Performs the entire AI cover generation process with UI☆30Aug 4, 2025Updated 7 months ago
- A toolkit for speaker diarization.☆406Feb 9, 2026Updated 3 weeks ago
- Codename's rvc fork version 3, based on Applio.☆37Aug 2, 2025Updated 7 months ago
- An AI focused photo manipulation tool based on Gradio☆182Jun 28, 2025Updated 8 months ago
- Ultimate Vocal Remover for Google Colab☆56Oct 20, 2024Updated last year
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.☆67Oct 2, 2025Updated 5 months ago
- Generate 3D meshes from a single 2D image using TripoSR, complete with manual geometry editing and texture baking support☆56Oct 17, 2024Updated last year
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆120Jan 23, 2025Updated last year
- The best OSS video generation models☆135Oct 24, 2024Updated last year
- Ultimate Vocal Remover CLI☆159Feb 5, 2025Updated last year
- A simple, high-quality voice conversion tool focused on ease of use and performance.☆3,032Updated this week
- ☆15Jan 21, 2025Updated last year
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (…☆18Sep 12, 2023Updated 2 years ago
- ☆14Jun 23, 2024Updated last year
- Colab inference for ZFTurbo's Music-Source-Separation-Training☆59Jan 27, 2026Updated last month
- ComfyUI Custom Nodes for "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching". This generates high-quality 44…☆104Mar 28, 2025Updated 11 months ago
- A diffusers pipeline for zero shot stylised couples portrait creation☆100Dec 10, 2024Updated last year
- An official implementation of SwapAnyone.☆74Mar 14, 2025Updated 11 months ago
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆6,342Dec 5, 2025Updated 3 months ago
- Detect and extract tables to markdown and csv☆754Jan 24, 2025Updated last year
- Open source inference code for Rev's model☆434Apr 22, 2025Updated 10 months ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆23,816Mar 13, 2025Updated 11 months ago
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]☆18Dec 9, 2024Updated last year
- AudioSR-Colab-Fork☆51Oct 12, 2025Updated 4 months ago
- ☆33Aug 9, 2024Updated last year
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆41Apr 20, 2024Updated last year
- Memory-Guided Diffusion for Expressive Talking Video Generation☆173Mar 9, 2025Updated 11 months ago
- ☆16Apr 23, 2024Updated last year
- [CVPR'25] Official Implementations for Paper - AniDoc: Animation Creation Made Easier☆569Apr 15, 2025Updated 10 months ago
- gradio WebUI for AdvancedLivePortrait☆527Mar 13, 2025Updated 11 months ago
- Repository for training models for music source separation.☆1,182Feb 4, 2026Updated last month
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆3,951Aug 14, 2025Updated 6 months ago
- OpenMusic: SOTA Text-to-music (TTM) Generation☆633Jun 26, 2025Updated 8 months ago
- The BEST music separation model with help of A.I. ... to my ears ! 👂👂☆147Jun 10, 2024Updated last year
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,169Updated this week