Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
☆1,039Jan 24, 2026Updated last month
Alternatives and similar repositories for python-audio-separator
Users that are interested in python-audio-separator are comparing it to the libraries listed below
Sorting:
- Ultimate Vocal Remover CLI☆158Feb 5, 2025Updated last year
- Repository for training models for music source separation.☆1,165Feb 4, 2026Updated 3 weeks ago
- Ultimate Vocal Remover CLI type for Google Colab☆67Aug 16, 2025Updated 6 months ago
- Ultimate Vocal Remover 5 with Gradio UI. Separate an audio file into various stems, using multiple models☆604Oct 18, 2025Updated 4 months ago
- Model for MDX23 music separation contest☆821Apr 5, 2025Updated 10 months ago
- ☆329Jan 12, 2025Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆91Jul 23, 2025Updated 7 months ago
- Automatically create synchronised lyrics files in ASS and LRC with word-level timestamps, using Whisper and lyrics from online sources, w…☆89Jan 19, 2026Updated last month
- Ultimate Vocal Remover Inference CLI☆109Updated this week
- Vocal Remover using Deep Neural Networks☆1,744Jul 23, 2024Updated last year
- BandIt: Cinematic Audio Source Separation☆154Jul 29, 2025Updated 7 months ago
- GUI for a Vocal Remover that uses Deep Neural Networks.☆23,757Mar 13, 2025Updated 11 months ago
- GUI for Music-Source-Separation-Training☆22Updated this week
- Versatile audio super resolution (any -> 48kHz) with AudioSR.☆1,765Aug 27, 2025Updated 6 months ago
- LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM☆18May 17, 2024Updated last year
- Attempt to fully automate the creation of karaoke music videos, using open source tools and AI (e.g. Whisper & MDX-Net)☆61Jul 2, 2025Updated 7 months ago
- 🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!☆227Jul 12, 2025Updated 7 months ago
- Next-generation TTS model using flow-matching and DiT, inspired by Stable Diffusion 3☆434Sep 13, 2024Updated last year
- Music repair method to convert lossy MP3 compressed music to lossless music.☆357Aug 12, 2025Updated 6 months ago
- Sound Demixing Challenge 2023☆95Jul 14, 2023Updated 2 years ago
- Unofficial PyTorch implementation of Music Source Separation with Band-split RNN☆187Jun 10, 2024Updated last year
- a lightweight voice conversion☆86Sep 2, 2024Updated last year
- ☆19Mar 22, 2024Updated last year
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆52Jul 29, 2025Updated 7 months ago
- SOFA: Singing-Oriented Forced Aligner☆208May 16, 2025Updated 9 months ago
- Official implementation of "Separate Anything You Describe"☆1,875Nov 26, 2024Updated last year
- Application Data☆12Mar 6, 2025Updated 11 months ago
- Unofficial implementation of ConvNeXt-TTS powered by lightning☆18Oct 20, 2024Updated last year
- AI powered speech denoising and enhancement☆2,198Dec 3, 2024Updated last year
- The BEST music separation model with help of A.I. ... to my ears ! 👂👂☆147Jun 10, 2024Updated last year
- Colab adaptation of MVSep Model for MDX23 music separation contest☆328Sep 25, 2024Updated last year
- All-In-One Music Structure Analyzer☆721May 9, 2024Updated last year
- An AI-Powered Speech Processing Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Enhancement, Separation, and Target Spe…☆3,936Aug 14, 2025Updated 6 months ago
- API for a Vocal Remover that uses Deep Neural Networks.☆137Jul 1, 2024Updated last year
- This is an implementation for train hifigan part of XTTSv2 model using Coqui/TTS.☆87Nov 12, 2024Updated last year
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆56Nov 10, 2025Updated 3 months ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆54Oct 31, 2023Updated 2 years ago
- KUIELAB-MDX-Net got the 2nd place on the Leaderboard A and the 3rd place on the Leaderboard B in the MDX-Challenge ISMIR 2021☆231Feb 27, 2023Updated 3 years ago
- PromptTTS++: Controlling Speaker Identity in Prompt-Based Text-To-Speech Using Natural Language Descriptions☆83Oct 11, 2024Updated last year