nomadkaraoke / python-audio-separator
Easy to use vocal separation from CLI or as a python package, using a variety of amazing pre-trained models (primarily from UVR)
☆407Updated this week
Related projects: ⓘ
- Repository for training models for music source separation.☆379Updated this week
- Ultimate Vocal Remover CLI☆103Updated 6 months ago
- Implementation of Band Split Roformer, SOTA Attention network for music source separation out of ByteDance AI Labs☆395Updated last month
- Model for MDX23 music separation contest☆604Updated 2 months ago
- Colab adaptation of MVSep Model for MDX23 music separation contest☆253Updated last month
- ☆203Updated 7 months ago
- singing voice change based on whisper, and lora for singing voice clone☆617Updated 10 months ago
- in preparation...☆254Updated 2 months ago
- General Speech Restoration☆999Updated 3 months ago
- FreeVC: Towards High-Quality Text-Free One-Shot Voice Conversion☆589Updated 5 months ago
- ☆399Updated 3 weeks ago
- Python script that slices audio with silence detection☆754Updated 3 months ago
- 🚀 RVC + UVR = A perfect set of tools for voice cloning, easily and free!☆122Updated last week
- Easily train a good VC model with voice data <= 10 mins!☆114Updated 2 weeks ago
- VISinger 2: High-Fidelity End-to-End Singing Voice Synthesis Enhanced by Digital Signal Processing Synthesizer☆309Updated 2 months ago
- unofficial vits2-TTS implementation in pytorch☆472Updated 5 months ago
- model_repo☆100Updated last year
- Preprocess Audio for training☆223Updated last month
- Vocos: Closing the gap between time-domain and Fourier-based neural vocoders for high-quality audio synthesis☆763Updated last month
- 🔊 Create labeled datasets, enhance audio quality, identify speakers, support diverse dataset types. 🎧👥📊 Advanced audio processing.☆194Updated 3 months ago
- ☆163Updated last month
- Audio Slicer that uses silence detection to split .wav audio files into multiple .wav samples.☆288Updated 4 months ago
- An easy to understand TTS / SVS / SVC framework☆623Updated last month
- Text to Speech using Coqui TTS + RVC☆87Updated 6 months ago
- AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI mo…☆462Updated 2 weeks ago
- AI powered speech denoising and enhancement☆1,277Updated 2 months ago
- Versatile audio super resolution (any -> 48kHz) with AudioSR.☆1,083Updated 4 months ago
- VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design☆462Updated last year
- API for a Vocal Remover that uses Deep Neural Networks.☆75Updated 2 months ago
- QuickVC: Any-to-many Voice Conversion Using Inverse Short-time Fourier Transform for Faster Conversion☆221Updated last year