mjhydri / Singing-Vocal-Beat-Tracking
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…
☆30Updated 2 years ago
Alternatives and similar repositories for Singing-Vocal-Beat-Tracking:
Users that are interested in Singing-Vocal-Beat-Tracking are comparing it to the libraries listed below
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆64Updated 3 weeks ago
- Project for MIDI to Audio Synthesis☆23Updated 2 years ago
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆22Updated last year
- A piano music dataset with Audio, Symbolic and Text labels☆27Updated last month
- Chorale Music Separation Dataset and Model Framework☆35Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆41Updated last year
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆41Updated 2 months ago
- ☆23Updated 11 months ago
- Deep Performer: Score-to-audio music performance synthesis☆43Updated last year
- Polyphonic generalisation of DDSP☆18Updated 11 months ago
- An implementation of "Towards Improving Harmonic Sensitivity and Prediction Stability for Singing Melody Extraction", in ISMIR 2023☆22Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆26Updated 11 months ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆48Updated last year
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆64Updated last year
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆55Updated 4 months ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆62Updated 2 years ago
- ☆22Updated 2 years ago
- The official implementation of TokenSynth (ICASSP 2025)☆59Updated last month
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆30Updated last year
- [ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice☆28Updated 2 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆59Updated 2 years ago
- ☆18Updated 3 years ago
- PodcastMix A dataset for separating music and speech in podcasts.☆43Updated 7 months ago
- [PyTorch] Minimal codebase for MusicGen models☆58Updated 3 months ago
- Rough implementation of Simultaneous Separation and Transcription of Mixtures with Multiple Polyphonic and Percussive Instruments (Ethan …☆24Updated 4 years ago
- ☆55Updated 5 months ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆36Updated 7 months ago
- A PyTorch implementation: "LASAFT-Net-v2: Listen, Attend and Separate by Attentively aggregating Frequency Transformation"☆33Updated 3 years ago
- Repository for ISMIR 2022 tutorial T3(M): Designing Controllable Synthesis System for Musical Signals☆28Updated 2 years ago