South-Twilight / SingMOSLinks
☆26Updated 4 months ago
Alternatives and similar repositories for SingMOS
Users that are interested in SingMOS are comparing it to the libraries listed below
Sorting:
- Robust Singing Voice Transcription and MIDI Extraction☆93Updated 11 months ago
- VAE modified from Descript Audio Codec, which replaces the RVQ with VAE☆81Updated last year
- Music generation☆24Updated last year
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- ☆108Updated 2 months ago
- STARS: A Unified Framework for Singing Transcription, Alignment, and Refined Style Annotation☆57Updated 3 months ago
- ☆47Updated last year
- ☆44Updated last year
- Vocoder NSF-HiFiGAN (Moved into deepaudio)☆55Updated 2 years ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆88Updated 4 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆44Updated 5 months ago
- A Singing Style Conversion Framework Based On Audio Infilling☆30Updated 6 months ago
- MeanAudio: Fast and Faithful Text-to-Audio Generation with Mean Flows☆105Updated 2 months ago
- Source code of APNet2, a vocoder☆55Updated last year
- ☆59Updated last year
- Training, validation, and inference code for various SSL approaches and architectures.☆67Updated last week
- ☆46Updated 2 weeks ago
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Updated last year
- ☆38Updated this week
- Comprehensive benchmark suite comparing pitch detection algorithms across multiple datasets.☆37Updated 2 months ago
- Fast and accurate fundamental frequency (F0) detector using convolutional neural networks☆86Updated 2 months ago
- LAFMA: A Latent Flow Matching Model for Text-to-Audio Generation (INTERSPEECH 2024)☆41Updated last year
- Audio-to-Audio Schrodinger Bridges is a diffusion-based audio restoration model for bandwidth extension and inpainting.☆109Updated 2 months ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆83Updated 3 months ago
- ☆106Updated last year
- AQUA-Tk = Audio QUality Assessment-Toolkit. (In development)☆100Updated 11 months ago
- PAM is a no-reference audio quality metric for audio generation tasks☆74Updated last year
- A DDSP-based neural voice synthesiser.☆119Updated 11 months ago
- Pitch Controllable DDSP Vocoders☆77Updated 11 months ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆45Updated 9 months ago