NextAudioGen / ultimatevocalremover_api
API for a Vocal Remover that uses Deep Neural Networks.
☆97Updated 7 months ago
Alternatives and similar repositories for ultimatevocalremover_api:
Users that are interested in ultimatevocalremover_api are comparing it to the libraries listed below
- Diffusion Singing Voice Conversion based on Grad-TTS from HuaWei☆141Updated last year
- CoMoSVC: One-Step Consistency Model Based Singing Voice Conversion & Singing Voice Clone☆135Updated 10 months ago
- ☆235Updated last year
- SOFA: Singing-Oriented Forced Aligner☆148Updated last week
- Unofficial implementation of NaturalSpeech2 for Voice Conversion and Text to Speech☆234Updated 11 months ago
- ☆201Updated 2 years ago
- ☆63Updated last year
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated last year
- Singing Voice Synthesis based on VITS, different from VISinger☆188Updated last year
- Diffusion-based singing voice pitch correction☆102Updated 5 months ago
- ☆127Updated last month
- ☆71Updated 4 months ago
- ☆95Updated 2 weeks ago
- Ultimate Vocal Remover CLI type for Google Colab☆50Updated last month
- text to speech using autoregressive transformer and VITS☆234Updated 10 months ago
- Official Implementation of StyleTTS-VC☆175Updated last month
- a text-conditional diffusion probabilistic model capable of generating high fidelity audio.☆144Updated 8 months ago
- Implementation of Prompt-Singer: Controllable Singing-Voice-Synthesis with Natural Language Prompt (NAACL'24).☆102Updated 3 weeks ago
- ☆65Updated last year
- Train the next generation of TTS systems.☆162Updated 5 months ago
- ACM MM 2023 CoMoSpeech: One-Step Speech and Singing Voice Synthesis via Consistency Model☆201Updated 9 months ago
- Ultimate Vocal Remover Inference CLI☆54Updated 2 weeks ago
- TTSAudioNormalizer is a specialized tool for TTS data production, featuring descriptive statistical analysis of audio loudness and loud…☆91Updated 2 months ago
- ChatTTS is a generative speech model for daily dialogue.☆21Updated last month
- An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io☆67Updated last year
- ☆123Updated last week
- VITS with phoneme-level prosody modeling based on MaskGIT☆81Updated 5 months ago
- ☆73Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆80Updated 10 months ago
- Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.☆35Updated last year