NextAudioGen/ultimatevocalremover_api

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/NextAudioGen/ultimatevocalremover_api)

NextAudioGen / ultimatevocalremover_api

API for a Vocal Remover that uses Deep Neural Networks.

☆144

Alternatives and similar repositories for ultimatevocalremover_api

Users that are interested in ultimatevocalremover_api are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
RickyL-2000 / ROSVOT
View on GitHub
Robust Singing Voice Transcription and MIDI Extraction
☆123Nov 20, 2024Updated last year
ryota-komatsu / speaker_disentangled_hubert
View on GitHub
Official repository of the IEEE OJSP paper "Speaker-Disentangled Chunk-Wise Regression for Syllabic Tokenization"
☆46Jul 14, 2026Updated last week
Tencent / SongBench
View on GitHub
☆50Apr 30, 2026Updated 2 months ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
Eddycrack864 / UVR5-NO-UI
View on GitHub
Ultimate Vocal Remover CLI type for Google Colab
☆79Jun 26, 2026Updated 3 weeks ago
Recordtini / MusicSepGUI
View on GitHub
GUI for Music-Source-Separation-Training
☆24Feb 27, 2026Updated 4 months ago
ydqmkkx / Respiro-en
View on GitHub
Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…
☆44Sep 18, 2024Updated last year
exercise-book-yq / Supercodec
View on GitHub
☆51Mar 5, 2026Updated 4 months ago
AmphionTeam / SpeechJudge
View on GitHub
SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)
☆77Dec 23, 2025Updated 6 months ago
ishine / Mutiband-HIFIGAN
View on GitHub
Mutiband version of HIFIGAN
☆19Nov 6, 2020Updated 5 years ago
auspicious3000 / ProsodyLM
View on GitHub
ProsodyLM: Uncovering the Emerging Prosody Processing Capabilities in Speech Language Models
☆46Nov 18, 2025Updated 8 months ago
MichiyamaKaren / ultimatevocalremover-webui
View on GitHub
GUI for a Vocal Remover that uses Deep Neural Networks.
☆18Jan 18, 2024Updated 2 years ago
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
FlyToYourMooN / DDPM-Midi2Performance-Model
View on GitHub
Music generation
☆26May 2, 2024Updated 2 years ago
reppy4620 / vocoders
View on GitHub
My vocoder experiments
☆31Jul 26, 2025Updated 11 months ago
CNChTu / FCPE
View on GitHub
☆203Oct 14, 2025Updated 9 months ago
asuni / PitchSqueezer
View on GitHub
A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation
☆38Jan 17, 2024Updated 2 years ago
seanghay / uvr
View on GitHub
Ultimate Vocal Remover CLI
☆167Feb 5, 2025Updated last year
astradzhao / music-rfm
View on GitHub
Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…
☆40Oct 26, 2025Updated 8 months ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
jamesparsloe / llm.speech
View on GitHub
Trying to build an all in one speech-text language model - a bit like GPT-4o
☆22Jun 1, 2024Updated 2 years ago
yxlllc / vocal-remover
View on GitHub
Vocal Remover using Deep Neural Networks
☆21Dec 31, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
innnky / MagVITS
View on GitHub
VITS with phoneme-level prosody modeling based on MaskGIT
☆85Aug 31, 2024Updated last year
redmist328 / APNet2
View on GitHub
Source code of APNet2, a vocoder
☆60Nov 23, 2023Updated 2 years ago
nomadkaraoke / python-audio-separator
View on GitHub
Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (pr…
☆1,292Updated this week
OpenNSP / Hifi-vaegan
View on GitHub
☆47Aug 31, 2024Updated last year
bovod-sjtu / HoliTok
View on GitHub
HoliTok:A Coutinuous Holistic Tokenization with Robust Dual Capabilities of Speech Generation and Understanding
☆36Jun 8, 2026Updated last month
mush42 / istft-onnx
View on GitHub
Export an ONNX graph that performs ISTFT. Designed for TTS models.
☆28Apr 23, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
xinshengwang / robpitch
View on GitHub
A pitch detection model trained to be robust against noise and reverberation environments.
☆27Jan 21, 2025Updated last year
KimberleyJensen / Mel-Band-Roformer-Vocal-Model
View on GitHub
☆405Jan 12, 2025Updated last year
seanghay / uvr-mdx-infer
View on GitHub
Ultimate Vocal Remover Inference CLI
☆120Feb 27, 2026Updated 4 months ago
ZFTurbo / Music-Source-Separation-Training
View on GitHub
Repository for training models for music source separation.
☆1,446Jul 12, 2026Updated last week
bleugreen / deeprhythm
View on GitHub
fast, precise tempo prediction in python
☆70Feb 24, 2026Updated 4 months ago
winddori2002 / DEX-TTS
View on GitHub
DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability
☆108Jan 17, 2025Updated last year
7Xin / DPI-TTS
View on GitHub
☆13Sep 12, 2024Updated last year