yxlllc / vocal-remover
Vocal Remover using Deep Neural Networks
☆16Updated last month
Alternatives and similar repositories for vocal-remover:
Users that are interested in vocal-remover are comparing it to the libraries listed below
- ☆38Updated 5 months ago
- Implementation of RIFT-SVC, a singing voice conversion model based on Rectified Flow Transformer.☆31Updated last month
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆48Updated 2 weeks ago
- ☆39Updated last year
- ☆63Updated last year
- Bilingual Singing Voice Synthesis☆18Updated 10 months ago
- ☆24Updated last year
- SOFA_AI: Singing-Oriented Forced Aligner for Automatic Inference☆21Updated 8 months ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆50Updated 2 years ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆47Updated last year
- ONNX deployment of the CREPE pitch tracker☆20Updated 2 years ago
- PitchVC: Pitch Conditioned Any-to-Many Voice Conversion☆34Updated 8 months ago
- HiFTNet wav/audio super-resolution 16/24 kHz to 48 kHz☆24Updated last year
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 5 months ago
- dog-can-sing-song☆20Updated 3 months ago
- ☆14Updated last week
- ☆13Updated last year
- AudioSR-Upsampling (any -> 48kHz)☆38Updated last year
- Multispeaker Community Vocoder Model for DiffSinger☆35Updated 9 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆63Updated 10 months ago
- ☆22Updated last year
- [NAACL 2025] WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow Matching☆36Updated this week
- source code of EfficientTTS 2☆12Updated last year
- singing voice conversion based on glow-tts☆11Updated last year
- Reimplementation of Miipher☆20Updated last year
- ☆44Updated last year
- A minimum inference engine for DiffSinger☆34Updated 10 months ago
- Conditional Variational Auto-Encoder with Jointly Training FastSpeech2(+Conformer) and HiFi-GAN for End to End Text to Speech☆46Updated 2 years ago