A complete training recipe for kaldi-based Automatic Lyrics Transcription.
☆31Nov 30, 2021Updated 4 years ago
Alternatives and similar repositories for ALTA
Users that are interested in ALTA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.☆12Nov 30, 2021Updated 4 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆19Nov 23, 2021Updated 4 years ago
- ☆15Sep 26, 2022Updated 3 years ago
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆15Oct 13, 2022Updated 3 years ago
- ☆22Sep 26, 2022Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Pre-trained model and script to automatically align lyrics to polyphonic audio☆115Jun 16, 2020Updated 5 years ago
- DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆89Apr 30, 2025Updated 10 months ago
- ☆65Jun 26, 2025Updated 9 months ago
- DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.☆380Jun 11, 2020Updated 5 years ago
- Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignme…☆59Mar 9, 2020Updated 6 years ago
- Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…☆18Oct 27, 2020Updated 5 years ago
- ☆18Jan 20, 2025Updated last year
- A web app for annotating Freesound loops, and the tools to analyse the dataset created.☆20Jul 6, 2023Updated 2 years ago
- Implementation of paper "End-to-end lyrics alignment for polyphonic music using an audio-to-character recognition model"☆18Nov 20, 2022Updated 3 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- The Harmonix Set: Beats, Downbeats, and Structural Annotations for Pop Music☆229Dec 7, 2024Updated last year
- Project for MIDI to Audio Synthesis☆27Mar 13, 2023Updated 3 years ago
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆49May 7, 2024Updated last year
- Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…☆93Feb 13, 2018Updated 8 years ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆63Mar 5, 2026Updated 3 weeks ago
- ☆226Dec 29, 2022Updated 3 years ago
- A dataset of pitch curves for music performance assessment☆10Jun 5, 2023Updated 2 years ago
- Readability-aware automatic lyrics transcription (ALT) evaluation toolkit☆43Aug 29, 2024Updated last year
- ☆14Nov 26, 2024Updated last year
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Implementation of different noise embeddings for noise aware training of Kaldi acoustic models.☆13Feb 13, 2021Updated 5 years ago
- Util code, issues, discussions☆29Aug 31, 2018Updated 7 years ago
- Music structure segmentation with convnets☆13Mar 11, 2016Updated 10 years ago
- Wave-U-Net for automatic (drum) mixing☆38Mar 24, 2023Updated 3 years ago
- Perform transfer learning for MIR using Jukebox!☆187Oct 12, 2023Updated 2 years ago
- Algorithm and Data for paper "Automatic Detection of Hierarchical Structure and Influence of Structure on Melody, Harmony and Rhythm in P…☆100Oct 5, 2022Updated 3 years ago
- An opensource music processing toolkit☆319Jun 25, 2023Updated 2 years ago
- DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).☆13May 25, 2021Updated 4 years ago
- In this repository, I try to combine k2 with speechbrain to decode well and fastly.☆16Jun 17, 2022Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- This repo contains the baseline model recipes and pre-trained model for GramVanni hindi ASR challenge☆15Mar 26, 2022Updated 4 years ago
- Simple collection of MIR datasets with metadata and links☆256Updated this week
- Contains code for our work on speech to singing conversion (ICASSP 2020)☆50Oct 27, 2020Updated 5 years ago
- MIR conference deadline countdowns☆19Jun 24, 2022Updated 3 years ago
- Phone-level evaluation of L2 speakers (GOP algorithm)☆27Mar 1, 2017Updated 9 years ago
- Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation☆24Nov 8, 2021Updated 4 years ago
- PyTorch Implementation of Multi-Singer (ACM-MM'21)☆139May 8, 2022Updated 3 years ago