amazon-science / unsupervised-melody-to-lyrics-generation
This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Oraby, Alessandra Cervone, Gunnar Sigurdsson, Chenyang Tao, Wenbo Zhao, Tagyoung Chung, Jing Huang, and Nanyun Peng.
☆11Updated last year
Alternatives and similar repositories for unsupervised-melody-to-lyrics-generation:
Users that are interested in unsupervised-melody-to-lyrics-generation are comparing it to the libraries listed below
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆17Updated last month
- music semantic understanding evaluation benchmark☆25Updated last year
- A spoken version of the textual story cloze benchmark☆14Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆25Updated 2 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆23Updated 9 months ago
- Deep Performer: Score-to-audio music performance synthesis☆42Updated last year
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- The code repository for our paper "Interpreting Song Lyrics with a Music-Informed Pre-trained Language Model".☆22Updated 2 years ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆31Updated last month
- Audio Generation model working with GPT-2 and VQVAE compressed representation of MelSpectrograms☆18Updated last year
- Project for MIDI to Audio Synthesis☆22Updated last year
- Pushing the Limits of Zero-shot End-to-End Speech Translation☆25Updated last month
- ☆16Updated 4 months ago
- 60k hours of phoneme-aligned audio from audio books☆18Updated 6 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆14Updated 6 months ago
- Perceived Music Quality Dataset☆11Updated 7 months ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆19Updated last year
- ☆34Updated 9 months ago
- Torch implementation of Whisper-guided DDPM based Voice Conversion☆49Updated last year
- ☆25Updated 6 months ago
- NISQA - Non-Intrusive Speech Quality and TTS Naturalness Assessment☆16Updated 2 years ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆27Updated 5 months ago
- ☆19Updated last year
- Aligner for text-to-speech☆15Updated 6 months ago
- ☆41Updated last year
- A unified model for zero-shot singing voice conversion and synthesis