This is a demo for SOTA vocal separation models. Upload an audio file and the model will separate the vocals from the background music. Based on the result of MDX23, the current SOTA model is BS-RoFormer.
☆18Jul 25, 2024Updated last year
Alternatives and similar repositories for vocal-separation
Users that are interested in vocal-separation are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆10Jun 6, 2023Updated 3 years ago
- ☆22Feb 27, 2026Updated 4 months ago
- Voice data <= 10 mins can also be used to train a good VC model!☆15Oct 1, 2024Updated last year
- Program that enables seamless interaction with your documents through an advanced vector database and the power of Large Language Model (…☆18Sep 12, 2023Updated 2 years ago
- Code associated with the paper: CTC-DRO: Robust Optimization for Reducing Language Disparities in Speech Recognition.☆17May 16, 2025Updated last year
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- Advanced RVC Inference for quicker and effortless model downloads☆78Jun 24, 2026Updated last week
- ☆13Apr 26, 2026Updated 2 months ago
- ☆10Apr 16, 2024Updated 2 years ago
- A simple UTAU voicebank recorder app for android.☆16Jun 21, 2026Updated last week
- 爬取微信公众号文章,更新QT6以适配macOS☆11Sep 26, 2023Updated 2 years ago
- Dự án công cụ chuyển đổi giọng nói dành cho người Việt☆32Jun 15, 2026Updated 2 weeks ago
- THIS VERSION IS DEPRECATED, CHECK OUT THE NEW REPO: https://github.com/JoaTH-Team/Rhythmo☆15Dec 2, 2025Updated 7 months ago
- Bot WhatsApp Use Pairing Code.☆11Jun 6, 2024Updated 2 years ago
- A WebUI to create speech to speech with any RVC v2 trained AI voice☆28Jun 28, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Facial Landmark labels for Manga 109 dataset☆12Nov 21, 2019Updated 6 years ago
- ☆15Aug 27, 2025Updated 10 months ago
- A modification of Psych Engine with some twist.☆13Sep 2, 2023Updated 2 years ago
- For already-build distribution of ZoiaPatchViewer☆10Jan 5, 2023Updated 3 years ago
- Performs the entire AI cover generation process with UI☆30Aug 4, 2025Updated 11 months ago
- ☆25Oct 24, 2025Updated 8 months ago
- A real KinitoPET for your desktop☆22Mar 4, 2024Updated 2 years ago
- A vocal source separation☆42Feb 2, 2025Updated last year
- WhatsApp BOT RPG☆15Oct 29, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.☆20Mar 12, 2026Updated 3 months ago
- Various annotations of Manga109 dataset☆13Apr 23, 2025Updated last year
- bilibili视频信息获取,UP主全部视频信息获取☆14Apr 24, 2026Updated 2 months ago
- This is the repository for the work "BridgeVoC: Revitalizing Neural Vocoder from a Restoration Perspective".☆66Nov 5, 2025Updated 7 months ago
- A converter from Arpabet to IPA (see https://en.wikipedia.org/wiki/Arpabet)☆17Jan 2, 2018Updated 8 years ago
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- Pinokio System Programming☆36Dec 19, 2024Updated last year
- Modern, flexible, observable, testable app preferences written in Swift.☆19Jun 22, 2026Updated last week
- Grapheme-to-phoneme (G2P) conversion is the process of generating pronunciation for words based on their written form. It has a highly es…☆19Jun 14, 2021Updated 5 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- 该项目来源于阿里开源的语音降噪模型zipEnhancer☆41May 8, 2026Updated last month
- Far Cry is a first-person shooter (FPS) video game with amazing graphics, developed by Crytek and published by Ubisoft.☆13May 28, 2019Updated 7 years ago
- [NeurIPS 2025] Separate Anything in Audio with Zero Training☆59Nov 3, 2025Updated 8 months ago
- Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs☆855Jun 14, 2026Updated 2 weeks ago
- ☆15Jun 17, 2024Updated 2 years ago
- Training code for FAN☆19Apr 11, 2022Updated 4 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆126Jun 16, 2022Updated 4 years ago