yxlllc / vocal-removerLinks
Vocal Remover using Deep Neural Networks
☆17Updated 5 months ago
Alternatives and similar repositories for vocal-remover
Users that are interested in vocal-remover are comparing it to the libraries listed below
Sorting:
- ☆14Updated 2 months ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆47Updated 2 months ago
- ☆40Updated 9 months ago
- ☆66Updated last year
- ☆24Updated 2 years ago
- singing voice conversion based on glow-tts☆11Updated last year
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆66Updated 3 weeks ago
- Hubert-based Forced Aligner☆11Updated this week
- only rmvpe☆22Updated last year
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆21Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 9 months ago
- ☆39Updated last year
- MFA acoustic model training based on Opencpop☆15Updated 2 years ago
- singing voice conversion without f0☆22Updated 2 years ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆53Updated 2 years ago
- Incremental Disentanglement for Environment-Aware Zero-Shot Text-to-Speech Synthesis☆26Updated 2 months ago
- 基于FreeVC的歌声转换☆21Updated 2 years ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆12Updated 3 years ago
- Music generation☆24Updated last year
- A GUI Toolkit for SVS Label Generation. Heavily utilizes SOFA & Whisper to generate htk-style force-aligned labels with a focus on singin…☆33Updated 9 months ago
- AudioSR-Upsampling (any -> 48kHz)☆41Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆25Updated 4 months ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 11 months ago
- Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025☆28Updated 3 months ago
- Multispeaker Community Vocoder Model for DiffSinger☆37Updated 3 weeks ago
- Unofficial Pytorch implementation of SNAC: Speaker-normalized affine coupling layer in flow-based architecture for zero-shot multi-speake…☆57Updated last year
- dog-can-sing-song☆27Updated 3 weeks ago
- ☆29Updated last year
- ☆13Updated last year
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆71Updated last year