jhuang448 / MultilingualALT
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆11Updated 8 months ago
Alternatives and similar repositories for MultilingualALT:
Users that are interested in MultilingualALT are comparing it to the libraries listed below
- ☆16Updated 5 months ago
- ☆13Updated 2 months ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆25Updated 3 months ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆12Updated 5 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆33Updated 3 months ago
- Project for MIDI to Audio Synthesis☆21Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆25Updated 10 months ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆21Updated 6 months ago
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆15Updated 7 months ago
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆13Updated 4 months ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆16Updated last month
- Repository for "Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval Systems: a Survey"☆12Updated last month
- music semantic understanding evaluation benchmark☆25Updated last year
- Code for the paper "MULTI-BAND MASKING FOR WAVEFORM-BASED SINGING VOICE SEPARATION" that was accepted on EUSIPCO2022☆15Updated 2 years ago
- ☆15Updated 7 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆17Updated 2 months ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆21Updated last year
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆40Updated last year
- Official Repository for The Paper, Let Network Decide What to Learn: Symbolic Music Understanding Model Based on Large-scale Adversarial …☆14Updated last month
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆19Updated 3 weeks ago
- Implementation of "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs" in PyTorch☆18Updated 3 weeks ago
- [ICASSP 2025] FreeSVC: Towards Zero-shot Multilingual Singing Voice Conversion☆50Updated 3 weeks ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆18Updated 2 months ago