API for a Vocal Remover that uses Deep Neural Networks.
☆137Jul 1, 2024Updated last year
Alternatives and similar repositories for ultimatevocalremover_api
Users that are interested in ultimatevocalremover_api are comparing it to the libraries listed below
Sorting:
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- Robust Singing Voice Transcription and MIDI Extraction☆114Nov 20, 2024Updated last year
- Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"☆45Updated this week
- ☆44Oct 19, 2025Updated 5 months ago
- Ultimate Vocal Remover CLI type for Google Colab☆69Aug 16, 2025Updated 7 months ago
- Implementation of the paper "Variable Bitrate Residual Vector Quantization for Audio Coding"☆11Apr 10, 2025Updated 11 months ago
- ☆51Mar 5, 2026Updated 2 weeks ago
- GUI for Music-Source-Separation-Training☆22Feb 27, 2026Updated 3 weeks ago
- Mutiband version of HIFIGAN☆19Nov 6, 2020Updated 5 years ago
- Official Repository of Paper: "Towards High-Quality Zero-Shot Singing Voice Conversion in Low-Resource Scenarios"(AAAI 2026)☆91Jan 31, 2026Updated last month
- unofficial pytorch implementation of HiFi-GAN with fast MISR.☆15Mar 21, 2023Updated 3 years ago
- Music generation☆25May 2, 2024Updated last year
- My vocoder experiments☆31Jul 26, 2025Updated 7 months ago
- Easy to use stem (e.g. instrumental/vocals) separation from CLI or as a python package, using a variety of amazing pre-trained models (pr…☆1,082Updated this week
- ☆190Oct 14, 2025Updated 5 months ago
- ☆15Aug 22, 2025Updated 6 months ago
- A robust pitch tracker using synchro-squeezed fft and frequency domain autocorrelation☆36Jan 17, 2024Updated 2 years ago
- Vocal Remover using Deep Neural Networks☆19Dec 31, 2024Updated last year
- Trying to build an all in one speech-text language model - a bit like GPT-4o☆22Jun 1, 2024Updated last year
- Ultimate Vocal Remover CLI☆160Feb 5, 2025Updated last year
- ☆338Jan 12, 2025Updated last year
- ☆55Dec 24, 2025Updated 2 months ago
- Speech Resynthesis and Language Modeling☆27Jun 11, 2025Updated 9 months ago
- AudioSR-Colab-Fork☆51Oct 12, 2025Updated 5 months ago
- Phonemes and durations labeling based on whisper small☆11Jul 7, 2024Updated last year
- Sequence alignement methods with helpers for PyTorch.☆24Nov 30, 2022Updated 3 years ago
- Official implementation of paper: Frame-Wise Breath Detection with Self-Training: An Exploration of Enhancing Breath Naturalness in Text-…☆40Sep 18, 2024Updated last year
- VITS with phoneme-level prosody modeling based on MaskGIT☆85Aug 31, 2024Updated last year
- ☆47Aug 31, 2024Updated last year
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- SpeechJudge: Towards Human-Level Judgment for Speech Naturalness (https://arxiv.org/abs/2511.07931)☆68Dec 23, 2025Updated 2 months ago
- Ultimate Vocal Remover Inference CLI☆110Feb 27, 2026Updated 3 weeks ago
- Repository for training models for music source separation.☆1,206Feb 4, 2026Updated last month
- Export an ONNX graph that performs ISTFT. Designed for TTS models.☆28Apr 23, 2024Updated last year
- [ACL 2024] Generative Pre-Trained Speech Language Model with Efficient Hierarchical Transformer☆69Nov 1, 2024Updated last year
- Voice activity detection and speaker gender segmentation audiovisual corpus☆16Jan 20, 2025Updated last year
- fast, precise tempo prediction in python☆65Feb 24, 2026Updated 3 weeks ago
- DEX-TTS: Diffusion-based EXpressive TTS with Style Modeling on Time Variability☆107Jan 17, 2025Updated last year