基于uvr5的歌唱人声分离
☆28Nov 5, 2022Updated 3 years ago
Alternatives and similar repositories for vocal_separation_by_uvr5
Users that are interested in vocal_separation_by_uvr5 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This branch of Asteroid contains code for the vocal harmony and chamber ensemble separation related papers.☆12Nov 7, 2024Updated last year
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- 为visinger SVS系统写的展示系统~本质仍然是个音乐播放器☆11Apr 18, 2023Updated 2 years ago
- ☆24Apr 10, 2023Updated 2 years ago
- [ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder☆13Mar 11, 2025Updated last year
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Spleeter C++动态库可执行文件实现音乐人声伴奏分离☆26Dec 26, 2022Updated 3 years ago
- 单独维护的中文TTS☆34Oct 28, 2022Updated 3 years ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Music generation☆25May 2, 2024Updated last year
- ultimate vocal remover application run on linux ubuntu1604☆57Mar 20, 2023Updated 3 years ago
- 面向服务架构的电商平台☆10Jun 21, 2022Updated 3 years ago
- Tidy Tunes is an easy-to-use pipeline for mining high-quality audio data for speech generation models. To do so, it chains multiple open …☆23Mar 17, 2026Updated last week
- Spectral Mapping of Singing Voices: U-Net-Assisted Vocal Segmentation☆13Feb 18, 2026Updated last month
- ☆15Aug 22, 2025Updated 7 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Speaker change detection using SincNet and an LSTM/Transformer☆58May 26, 2025Updated 10 months ago
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- ☆22Apr 4, 2023Updated 2 years ago
- The official repo/implementation of the paper "Training a Singing Transcription Model Using Connectionist Temporal Classification Loss an…☆12Mar 25, 2025Updated last year
- 苏州大学每日健康情况自动化打卡脚本☆13Mar 30, 2022Updated 3 years ago
- 综合项目实践项目学习记录+代码☆11Jun 18, 2022Updated 3 years ago
- ☆55Aug 11, 2022Updated 3 years ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆12Jul 5, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 9 months ago
- g2p for english tts☆19Nov 10, 2022Updated 3 years ago
- 一个桌面宠物程序,现在似乎发展成为桌面便签了。桌面便签程序见develop-todolist分支。☆11Nov 17, 2024Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- 基于FreeVC的歌声转换☆21Dec 16, 2022Updated 3 years ago
- ☆25Jan 24, 2023Updated 3 years ago
- [ICME 2024] Official Repository for The Paper, PianoBART: Symbolic Piano Music Understanding and Generating with Large-Scale Pre-Training☆22Aug 17, 2025Updated 7 months ago
- 基于 NSFW Model 色情图片识别鉴黄 后面更新视频检测☆32Jun 17, 2022Updated 3 years ago
- Inference code for Interspeech 2025 paper, "LSCodec: Low-Bitrate and Speaker-Decoupled Discrete Speech Codec"☆35Oct 23, 2025Updated 5 months ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- Ai吟美桌宠【c#/c++/spout2/directX】☆13Dec 5, 2024Updated last year
- ☆31Jul 16, 2025Updated 8 months ago
- Official PyTorch implementation of “MaskRIS: Semantic Distortion-aware Data Augmentation for Referring Image Segmentation”☆18Dec 5, 2024Updated last year
- A neural speech codec based on discrete WavLM representations☆26Aug 28, 2024Updated last year
- PFCC 社区博客☆14Updated this week
- Research code for the paper "Training speaker recognition systems with limited data" at https://arxiv.org/abs/2203.14688☆13Dec 2, 2024Updated last year