zhuole1025 / LyricWhizView external linksLinks
[ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
☆53Nov 20, 2023Updated 2 years ago
Alternatives and similar repositories for LyricWhiz
Users that are interested in LyricWhiz are comparing it to the libraries listed below
Sorting:
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆18Dec 14, 2023Updated 2 years ago
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆89Apr 30, 2025Updated 9 months ago
- Readability-aware automatic lyrics transcription (ALT) evaluation toolkit☆43Aug 29, 2024Updated last year
- Musical Word Embedding for Music Tagging and Retrieval [IEEE TASLP]☆27Apr 23, 2024Updated last year
- State-of-the-art pretrained music models for training, evaluation, inference☆160Jan 20, 2026Updated 3 weeks ago
- singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/☆19Dec 30, 2019Updated 6 years ago
- TheGlueNote is representation model for note-wise music alignment.☆12Jul 19, 2024Updated last year
- 👄🇧🇷 Alinhamento fonético forçado em Português Brasileiro☆12Jul 18, 2025Updated 6 months ago
- DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).☆13May 25, 2021Updated 4 years ago
- sliding HPSS and two stage HPSS (singing voice enhancement)☆17Oct 9, 2020Updated 5 years ago
- A Representation Evaluation Framework for Music Information Retrieval tasks☆53Apr 9, 2024Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆32Apr 22, 2024Updated last year
- Automatic lyrics alignment at phoneme or word level with a pre-trained deep neural network.☆41Aug 21, 2023Updated 2 years ago
- Implementation of paper "End-to-end lyrics alignment for polyphonic music using an audio-to-character recognition model"☆18Nov 20, 2022Updated 3 years ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆29Sep 11, 2025Updated 5 months ago
- ☆19Feb 2, 2023Updated 3 years ago
- ☆70Jun 12, 2025Updated 8 months ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27May 20, 2025Updated 8 months ago
- list of MIR dataset papers presented at ISMIR 2022☆61Dec 11, 2022Updated 3 years ago
- Moises Source Separation Public Dataset☆172Feb 5, 2025Updated last year
- PiCoGen (Piano Cover Generation) is an academic project aimed at developing an automatic piano cover generation system.☆46Dec 4, 2025Updated 2 months ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 4 months ago
- ☆32Apr 22, 2024Updated last year
- Findings of ACL 2023 | AlignSTS: a speech-to-singing (STS) model based on modality disentanglement and cross-modal alignment☆68Jul 5, 2024Updated last year
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆46May 24, 2025Updated 8 months ago
- Results and Models for Learning Audio Representations of Music Content☆107Dec 3, 2024Updated last year
- ☆65Jun 26, 2025Updated 7 months ago
- The official GitHub page for the survey paper "Foundation Models for Music: A Survey".☆220Sep 4, 2024Updated last year
- This is the accompanying repository to the paper - Automatic Estimation of Singing Voice Musical Dynamics☆15Oct 28, 2024Updated last year
- ☆17Jun 24, 2025Updated 7 months ago
- MUSDB25 - A Fully Multitrack Dataset for Music Source Separation☆13Mar 29, 2025Updated 10 months ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆24Nov 8, 2021Updated 4 years ago
- PyTorch implementation of "Source Separation by Flow Matching (FLOSS)" by Google DeepMind☆91Nov 24, 2025Updated 2 months ago
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- Perform the forced decoding with target transcription☆11Sep 12, 2018Updated 7 years ago
- ☆13Dec 18, 2017Updated 8 years ago
- DysfluentWFST☆17Nov 13, 2025Updated 3 months ago
- A chinese singing voice dataset, professional male singer, 105 songs, 132 minutes☆11Oct 19, 2023Updated 2 years ago