yang123qwe/vocal_separation_by_uvr5

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/yang123qwe/vocal_separation_by_uvr5)

yang123qwe / vocal_separation_by_uvr5

基于uvr5的歌唱人声分离

☆30

Alternatives and similar repositories for vocal_separation_by_uvr5

Users that are interested in vocal_separation_by_uvr5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mtz1992 / spleeter
View on GitHub
人声背景声分离
☆13Sep 22, 2020Updated 5 years ago
saurjya / EnsembleSep
View on GitHub
This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.
☆12Nov 7, 2024Updated last year
y-chan / hifi-gan-misrnet
View on GitHub
unofficial pytorch implementation of HiFi-GAN with fast MISR.
☆15Mar 21, 2023Updated 3 years ago
swagger-coder / visinger_lab
View on GitHub
为visinger SVS系统写的展示系统～本质仍然是个音乐播放器
☆11Apr 18, 2023Updated 3 years ago
innnky / VISinger2-nomidi
View on GitHub
☆24Apr 10, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
lucasjinreal / textfrontend
View on GitHub
单独维护的中文TTS
☆34Oct 28, 2022Updated 3 years ago
hanshounsu / d3rm
View on GitHub
☆14Feb 3, 2026Updated 5 months ago
Infinity-INF / fast-phasr
View on GitHub
Phonemes and durations labeling based on whisper small
☆11Jul 7, 2024Updated 2 years ago
FlyToYourMooN / DDPM-Midi2Performance-Model
View on GitHub
Music generation
☆26May 2, 2024Updated 2 years ago
NanKeRen2020 / UVR5_Linux
View on GitHub
ultimate vocal remover application run on linux ubuntu1604
☆59Mar 20, 2023Updated 3 years ago
xinghyfish / suda-daily-health-report
View on GitHub
苏州大学每日健康情况自动化打卡脚本
☆13Mar 30, 2022Updated 4 years ago
xinghyfish / live-stream-classroom
View on GitHub
综合项目实践项目学习记录+代码
☆11Jun 18, 2022Updated 4 years ago
huutuongtu / Lightvoc
View on GitHub
LIGHTVOC AN UPSAMPLING-FREE GAN VOCODER BASED ON CONFORMER AND INVERSE SHORT-TIME FOURIER TRANSFORM
☆18May 17, 2024Updated 2 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
mbrotos / SoundSeg
View on GitHub
Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation
☆13Feb 18, 2026Updated 5 months ago
aqtq314 / VogenSVS
View on GitHub
☆15Apr 16, 2026Updated 3 months ago
Liyulingyue / DesktopPet
View on GitHub
一个桌面宠物程序，现在似乎发展成为桌面便签了。桌面便签程序见develop-todolist分支。
☆11Nov 17, 2024Updated last year
GeWanying / shap-anti-spoofing
View on GitHub
This repository includes the code to reproduce our paper [Explainable deepfake and spoofing detection: an attack analysis using SHapley A…
☆12Jan 24, 2024Updated 2 years ago
HHousen / speaker-change-detection
View on GitHub
Speaker change detection using SincNet and an LSTM/Transformer
☆57May 26, 2025Updated last year
york135 / CTC_CE_for_AST
View on GitHub
The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…
☆12Mar 25, 2025Updated last year
archinetai / aligner-pytorch
View on GitHub
Sequence alignement methods with helpers for PyTorch.
☆24Nov 30, 2022Updated 3 years ago
fss1t / CausalStarGANv2-VC
View on GitHub
☆22Apr 4, 2023Updated 3 years ago
So-Fann / VISinger
View on GitHub
☆55Aug 11, 2022Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
zaocan666 / DyViSE
View on GitHub
Dynamic vision-guided speaker embedding for audio-visual speaker diarization
☆12Jul 5, 2022Updated 4 years ago
ishine / Mutiband-HIFIGAN
View on GitHub
Mutiband version of HIFIGAN
☆19Nov 6, 2020Updated 5 years ago
ryota-komatsu / speech_resynth
View on GitHub
Speech Resynthesis and Language Modeling
☆27Jun 11, 2025Updated last year
jamesparsloe / llm.speech
View on GitHub
Trying to build an all in one speech-text language model - a bit like GPT-4o
☆22Jun 1, 2024Updated 2 years ago
RS2002 / PianoBart
View on GitHub
[ICME 2024 oral] Official Repository for The Paper, PianoBART: Symbolic Piano Music Understanding and Generating with Large-Scale Pre-Tra…
☆23Aug 17, 2025Updated 11 months ago
innnky / FreeSVC
View on GitHub
基于FreeVC的歌声转换
☆21Dec 16, 2022Updated 3 years ago
anton-kashkin / hifi_vc
View on GitHub
☆25Jan 24, 2023Updated 3 years ago
meaningTeam / tidy-tunes
View on GitHub
Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …
☆23May 19, 2026Updated 2 months ago
aiyoudiao / robot-four-a
View on GitHub
这是一个拥有四端的微信机器人应用程序，浏览器客户端(React 全家桶 + Ant Design UI)、监听服务端(TypeScript + Typeorm + RabbitMQ + **Wechaty** + Koa2)、存储服务端(TypeScript + Typeo…
☆12Dec 11, 2022Updated 3 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ishine / LangSegment
View on GitHub
It is a multi-lingual (97 languages) text content automatic recognition and segmentation tool.它是一个TTS多语言（97种语言）的混合文本内容自动识别和拆分工具。
☆23Feb 20, 2024Updated 2 years ago
yoongi43 / VRVQ
View on GitHub
Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"
☆11Apr 10, 2025Updated last year
yucongzh / online_speaker_diarization
View on GitHub
☆15Jul 11, 2022Updated 4 years ago
nikvaessen / w2v2-speaker-few-samples
View on GitHub
Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688
☆13Dec 2, 2024Updated last year
lucadellalib / discrete-wavlm-codec
View on GitHub
A neural speech codec based on discrete WavLM representations
☆26Aug 28, 2024Updated last year
topel / audioset-convnext-inf
View on GitHub
Adapting a ConvNeXt model to audio classification on AudioSet
☆27Feb 19, 2025Updated last year
lifeiteng / SoundStorm
View on GitHub
☆71Jul 13, 2023Updated 3 years ago