jhuang448 / MultilingualALT
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆11Updated 7 months ago
Alternatives and similar repositories for MultilingualALT:
Users that are interested in MultilingualALT are comparing it to the libraries listed below
- ☆12Updated last month
- ☆16Updated 4 months ago
- Project for MIDI to Audio Synthesis☆22Updated last year
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆14Updated 6 months ago
- A piano music dataset with Audio, Symbolic and Text labels☆25Updated 2 months ago
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆31Updated last month
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆13Updated 3 months ago
- FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation☆16Updated last month
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Updated last year
- Audio production style transfer with inference-time optimization☆33Updated 2 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- Landing Page for All Things Source Separation☆19Updated 2 months ago
- Code implementation for the paper titled MusicLIME: Explainable Multimodal Music Understanding☆14Updated this week
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆23Updated 9 months ago
- Repo for the IDESSAI 2024 course on modeling audio with discrete tokens.☆12Updated 4 months ago
- music semantic understanding evaluation benchmark☆25Updated last year
- ISMIR 24 Supplementary Material☆13Updated 3 months ago
- Code for the paper "Toward Fully Self-Supervised Multi-Pitch Estimation".☆18Updated 3 weeks ago
- Pytorch implementation of the invertible CQT based on Non-stationary Gabor filters☆29Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆21Updated last year
- ☆15Updated 6 months ago
- Event Relation in Text-to-Audio (TTA) Generation☆17Updated 2 weeks ago
- The official implementation of DMEL the method presented in the paper "DMEL: The differentiable log-Mel spectrogram as a trainable layer …☆17Updated last month
- Repository for "Natural Language Processing Methods for Symbolic Music Generation and Information Retrieval Systems: a Survey"☆12Updated last week
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆36Updated 4 months ago
- Reimplementation of Bandit for "Remastering Divide and Remaster: A Cinematic Audio Source Separation Dataset with Multilingual Support"☆24Updated 6 months ago
- ☆43Updated last year
- ☆12Updated last year