Eddycrack864 / UVR5-UIView external linksLinks
Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models
☆596Oct 18, 2025Updated 3 months ago
Alternatives and similar repositories for UVR5-UI
Users that are interested in UVR5-UI are comparing it to the libraries listed below
Sorting:
- Ultimate Vocal Remover CLI type for Google Colab☆66Aug 16, 2025Updated 5 months ago
- Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (pr…☆1,018Jan 24, 2026Updated 3 weeks ago
- Performs the entire AI cover generation process with UI☆29Aug 4, 2025Updated 6 months ago
- A toolkit for speaker diarization.☆392Updated this week
- Codename's rvc fork version 3, based on Applio.☆37Aug 2, 2025Updated 6 months ago
- An AI focused photo manipulation tool based on Gradio☆182Jun 28, 2025Updated 7 months ago
- Ultimate Vocal Remover for Google Colab☆56Oct 20, 2024Updated last year
- Gradio WebUI for whisper, faster-whisper, whisper-timestamped. Supports YouTube Downloader, Vocal Remover and Transcription.☆66Oct 2, 2025Updated 4 months ago
- Generate 3D meshes from a single 2D image using TripoSR, complete with manual geometry editing and texture baking support☆56Oct 17, 2024Updated last year
- Official implementation of DisEnvisioner: Disentangled and Enriched Visual Prompt for Customized Image Generation☆119Jan 23, 2025Updated last year
- The best OSS video generation models☆135Oct 24, 2024Updated last year
- Inference server for MioTTS, a lightweight and fast LLM-based TTS model.☆54Updated this week
- Ultimate Vocal Remover CLI☆157Feb 5, 2025Updated last year
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (…☆18Sep 12, 2023Updated 2 years ago
- Official implementation of "Real-SRGD: Enhancing Real-World Image Super-Resolution with Classifier-Free Guided Diffusion" [ACCV2024]☆18Dec 9, 2024Updated last year
- ☆14Jun 23, 2024Updated last year
- ☆15Jan 21, 2025Updated last year
- Colab inference for ZFTurbo's Music-Source-Separation-Training☆58Jan 27, 2026Updated 2 weeks ago
- ComfyUI Custom Nodes for "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching". This generates high-quality 44…☆104Mar 28, 2025Updated 10 months ago
- A simple, high-quality voice conversion tool focused on ease of use and performance.☆2,947Jan 31, 2026Updated 2 weeks ago
- A diffusers pipeline for zero shot stylised couples portrait creation☆100Dec 10, 2024Updated last year
- An official implementation of SwapAnyone.☆74Mar 14, 2025Updated 11 months ago
- Detect and extract tables to markdown and csv☆754Jan 24, 2025Updated last year
- Open source inference code for Rev's model☆435Apr 22, 2025Updated 9 months ago
- Jupyter notebooks for PuLID face transfer with Flux.1 dev. Able to run on Google Colab Free Tier☆18Dec 18, 2024Updated last year
- AudioSR-Colab-Fork☆51Oct 12, 2025Updated 4 months ago
- Text-Guided Generation of Full-Body Image with Preserved Reference Face for Customized Animation☆24Jun 24, 2024Updated last year
- ☆33Aug 9, 2024Updated last year
- GUI for a Vocal Remover that uses Deep Neural Networks.☆23,618Mar 13, 2025Updated 11 months ago
- Website source code for our ACM MM'23 paper "Hierarchical Masked 3D Diffusion Model for Video Outpainting".☆41Apr 20, 2024Updated last year
- Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with…☆6,245Dec 5, 2025Updated 2 months ago
- Memory-Guided Diffusion for Expressive Talking Video Generation☆173Mar 9, 2025Updated 11 months ago
- ☆16Apr 23, 2024Updated last year
- Repository for training models for music source separation.☆1,152Feb 4, 2026Updated last week
- [CVPR'25] Official Implementations for Paper - AniDoc: Animation Creation Made Easier☆567Apr 15, 2025Updated 10 months ago
- gradio WebUI for AdvancedLivePortrait☆526Mar 13, 2025Updated 11 months ago
- Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"☆14,079Updated this week
- OpenMusic: SOTA Text-to-music (TTM) Generation☆635Jun 26, 2025Updated 7 months ago
- The BEST music separation model with help of A.I. ... to my ears ! 👂👂☆147Jun 10, 2024Updated last year