emirdemirel/DALI-TestSet4ALT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/emirdemirel/DALI-TestSet4ALT)

emirdemirel / DALI-TestSet4ALT

This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.

☆12

Alternatives and similar repositories for DALI-TestSet4ALT

Users that are interested in DALI-TestSet4ALT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

groadabike / Kaldi-Dsing-task
View on GitHub
DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.
☆19Jul 9, 2026Updated 2 weeks ago
emirdemirel / ALTA
View on GitHub
A complete training recipe for kaldi-based Automatic Lyrics Transcription.
☆32Nov 30, 2021Updated 4 years ago
audioshake / alt-eval
View on GitHub
Readability-aware automatic lyrics transcription (ALT) evaluation toolkit
☆44Aug 29, 2024Updated last year
f90 / jamendolyrics
View on GitHub
DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
☆88Apr 30, 2025Updated last year
KinWaiCheuk / Jointist
View on GitHub
Official Implementation of Jointist
☆37Jul 26, 2023Updated 2 years ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
carlosholivan / symbolic-music-structure-analysis
View on GitHub
☆19Mar 27, 2023Updated 3 years ago
sony / timbre-trap
View on GitHub
Code for the paper "Timbre-Trap: A Low-Resource Framework for Instrument-Agnostic Music Transcription"
☆43May 5, 2024Updated 2 years ago
schufo / plla-tisvs
View on GitHub
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
☆24Nov 8, 2021Updated 4 years ago
rainerkelz / ICASSP19
View on GitHub
☆19Dec 13, 2019Updated 6 years ago
Sonata165 / ControllableLyricTranslation
View on GitHub
Code for the paper "Songs Across Borders: Singable and Controllable Neural Lyric Translation"
☆26Feb 3, 2026Updated 5 months ago
xiaoxue1117 / speech-mamba-public
View on GitHub
☆15Nov 26, 2024Updated last year
chitralekha18 / AutoLyrixAlign
View on GitHub
Pre-trained model and script to automatically align lyrics to polyphonic audio
☆116Jun 16, 2020Updated 6 years ago
napulen / AugmentedNet
View on GitHub
A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal Tasks
☆50Feb 11, 2024Updated 2 years ago
xjchenGit / SingGraph
View on GitHub
Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).
☆24Sep 19, 2025Updated 10 months ago
End-to-end encrypted cloud storage - Proton Drive • Ad
Special offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
bryan051003 / USVG
View on GitHub
A unified model for zero-shot singing voice conversion and synthesis
☆22Nov 30, 2022Updated 3 years ago
fundwotsai2001 / MIDI-SAG
View on GitHub
Official code for "MIDI-Informed Singing Accompaniment Generation in a Compositional Song Pipeline"
☆24May 30, 2026Updated last month
hiromu / contrastive-singing-voices
View on GitHub
Implementation of "Self-Supervised Contrastive Learning for Singing Voices"
☆20May 8, 2022Updated 4 years ago
mmorise / itako_singing
View on GitHub
東北イタコ歌唱データベースの最新ラベルデータ
☆24Jul 1, 2021Updated 5 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
chrisdonahue / music-cocreation-tutorial
View on GitHub
Start-to-finish tutorial for interactive music co-creation in PyTorch and Tensorflow.js
☆110Nov 6, 2021Updated 4 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
ws-choi / LASAFT-Net-v2
View on GitHub
A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"
☆33Apr 11, 2022Updated 4 years ago
desh2608 / kaldi-noise-vectors
View on GitHub
Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.
☆13Feb 13, 2021Updated 5 years ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
kwatcharasupat / musdb25
View on GitHub
MUSDB25 - A Fully Multitrack Dataset for Music Source Separation
☆13Mar 29, 2025Updated last year
salu133445 / deepperformer
View on GitHub
Deep Performer: Score-to-audio music performance synthesis
☆47Jun 26, 2023Updated 3 years ago
pc2752 / ss_synthesis
View on GitHub
☆17Jul 31, 2019Updated 6 years ago
zhuole1025 / LyricWhiz
View on GitHub
[ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
☆56Nov 20, 2023Updated 2 years ago
ws-choi / AMSS-Net
View on GitHub
A PyTorch implementation of the paper: "AMSS-Net: Audio Manipulation on User-Specified Sources with Textual Queries" (ACM Multimedia 2021…
☆21Jul 4, 2021Updated 5 years ago
helenacuesta / MelodyExtraction
View on GitHub
Melody Extraction project for the MIR course of the master in Sound and Music Computing at UPF (Barcelona).
☆39Mar 17, 2017Updated 9 years ago
SonyResearch / ITO-Master
View on GitHub
Implementation of the paper "ITO-Master: Inference-Time Optimization for Audio Effects Modeling of Music Mastering Processors"
☆27Jul 3, 2025Updated last year
jhuang448 / LyricsAlignment-MTL
View on GitHub
☆67Jun 26, 2025Updated last year
vincenzomadaghiele / MINGUS
View on GitHub
A transformer neural network that generates symbolic music improvising over chord changes.
☆19Jul 14, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
interactiveaudiolab / VocalImitationSet
View on GitHub
☆18Oct 16, 2018Updated 7 years ago
cyfer0618 / kaldi-pytorch-rnnlm
View on GitHub
Enable RNNLM lattice rescoring with Pytorch [kaldi]
☆12Jun 5, 2020Updated 6 years ago
kyungyunlee / mono2mixed-singer
View on GitHub
[ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice
☆28Dec 8, 2022Updated 3 years ago
ga642381 / Taiwanese-Whisper
View on GitHub
fine-tune Whipser model for Taiwanese speech recognition
☆37Mar 23, 2023Updated 3 years ago
SonyCSLParis / audio-metrics
View on GitHub
Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.
☆47Jan 15, 2026Updated 6 months ago
guxm2021 / ALT_SpeechBrain
View on GitHub
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
☆51May 7, 2024Updated 2 years ago
gpu-poor / gramvaani_hindi_asr
View on GitHub
This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge
☆16Mar 26, 2022Updated 4 years ago