jerryuhoo / VISingerLinks
Use VITS and Opencpop to develop singing voice synthesis; Different from VISinger.
☆35Updated 2 years ago
Alternatives and similar repositories for VISinger
Users that are interested in VISinger are comparing it to the libraries listed below
Sorting:
- a lightweight voice conversion☆85Updated last year
- Monotonic Alignment Search☆100Updated 5 months ago
- Unsupervised WaveNet-based Singing Voice Conversion Using Pitch Augmentation and Two-phase Approach☆70Updated 3 years ago
- Barkify: an unoffical training implementation of Bark TTS by suno-ai☆128Updated 2 years ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆87Updated 4 months ago
- Singing Voice Synthesis based on VITS, different from VISinger☆192Updated 2 years ago
- Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…☆32Updated last year
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139Updated 3 years ago
- Deep Neural Pitch Extractor for Voice Conversion and TTS Training☆142Updated 3 years ago
- BigVGAN with Neural Source-Filter☆56Updated 2 years ago
- AudioSR-Upsampling (any -> 48kHz)☆43Updated last year
- An High-resolution implementation of HiFi-GAN Vocoder for Voice Conversion.☆32Updated 2 years ago
- X-E-Speech: Joint Training Framework of Non-Autoregressive Cross-lingual Emotional Text-to-Speech and Voice Conversion☆109Updated last year
- Style-Controllable Zero-Shot Text to Speech Synthesizer based on VALL-E☆135Updated last year
- ☆81Updated 3 years ago
- [WIP] Unofficial Implementation of Microsoft's PromptTTS2☆52Updated 2 years ago
- Finetuning VITS Efficiently☆33Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆99Updated last year
- TechSinger: Technique Controllable Multilingual Singing Voice Synthesis via Flow Matching☆86Updated last month
- Implementation of Emo-StarGAN☆45Updated last year
- An unofficial PyTorch implementation of VALL-E☆88Updated 4 months ago
- GPT-style network for phonemization with durations of text☆68Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated last year
- ☆28Updated 2 years ago
- Autovocoder: Fast Waveform Generation from a Learned Speech Representation using Differentiable Digital Signal Processing☆71Updated 3 years ago
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆55Updated 2 years ago
- VITS-based zero-shot TTS system varying with diverse style/speaker conditioning methods.☆36Updated 3 years ago
- MFA acoustic model training based on Opencpop☆15Updated 3 years ago
- A Chinese version of A Neural Parametric Singing Synthesizer☆13Updated 3 years ago
- ☆124Updated this week