emirdemirel / ALTA
A complete training recipe for kaldi-based Automatic Lyrics Transcription.
☆29Updated 2 years ago
Related projects ⓘ
Alternatives and complementary repositories for ALTA
- Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation☆78Updated last year
- A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…☆14Updated 2 years ago
- The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music"☆41Updated 2 years ago
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆27Updated last year
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆77Updated 2 years ago
- DSing ASR task: Resources and Baseline for an unaccompanied singing ASR.☆20Updated 3 years ago
- pytorch implementation of JDCNet, singing voice detection and classification network☆49Updated last year
- Reproducible Subjective Evaluation☆57Updated 8 months ago
- ☆35Updated 2 years ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆54Updated 2 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 3 months ago
- acoss: Audio Cover Song Suite is a framework for feature extraction and benchmarking for the cover song identification (CSI) task☆37Updated last year
- ☆74Updated last year
- Revisiting Singing Voice Detection : a Quantitative Review and the Future Outlook☆66Updated 2 years ago
- ☆23Updated 5 years ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆54Updated this week
- Official implementation of "Equivariant Self-Supervision for Musical Tempo Estimation (ISMIR 2022)"☆25Updated last year
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆45Updated 6 months ago
- Repository for ISMIR 2022 tutorial T3(M): Designing Controllable Synthesis System for Musical Signals☆28Updated last year
- ☆56Updated last year
- singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/☆16Updated 4 years ago
- Readability-aware automatic lyrics transcription (ALT) evaluation toolkit☆34Updated 2 months ago
- ☆15Updated 2 years ago
- Rough implementation of Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments (Ethan …☆23Updated 3 years ago
- [ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice☆27Updated last year
- Yin pitch estimator in PyTorch☆115Updated 2 years ago
- Unofficial implementation of SpecTNT in pytorch☆42Updated 2 years ago
- Code for paper: "Deep Embeddings and Section Fusion Improve Music Segmentation"☆51Updated 2 years ago
- Full models and training code for PESTO☆52Updated 5 months ago
- A project to synthesize massive amounts of multitrack audio data from MIDI.☆56Updated 4 years ago