mjhydri / Singing-Vocal-Beat-TrackingView external linksLinks
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…
☆33Sep 4, 2022Updated 3 years ago
Alternatives and similar repositories for Singing-Vocal-Beat-Tracking
Users that are interested in Singing-Vocal-Beat-Tracking are comparing it to the libraries listed below
Sorting:
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆18Dec 14, 2023Updated 2 years ago
- This repository contains the implementation of an efficient joint beat, downbeat, tempo, and meter tracking system using a compact 1D pro…☆73Nov 28, 2023Updated 2 years ago
- [TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing☆26Aug 30, 2024Updated last year
- A Python package for IDyOM☆13Mar 31, 2023Updated 2 years ago
- ☆17Jun 24, 2025Updated 7 months ago
- A Python Library for Fundamental Frequency Estimation in Music Recordings☆53Jan 16, 2026Updated 3 weeks ago
- Synthesized singing voice demos of WeSinger 2 paper.☆26Feb 20, 2023Updated 2 years ago
- This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…☆11Jul 6, 2023Updated 2 years ago
- ☆11Dec 17, 2025Updated last month
- Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!☆32Jun 6, 2020Updated 5 years ago
- code and demo of the ISMIR 2021 paper CollageNet☆12Jul 12, 2021Updated 4 years ago
- Sing any popular song with your voice☆11Jul 10, 2022Updated 3 years ago
- Video Background Music Generation Using Unpaired Audio-Visual Data☆30Oct 8, 2024Updated last year
- Extension of the music21 library for working with music chords encoded according to the Harte Notation.☆13Apr 30, 2024Updated last year
- A toolset for easy formant extraction and visualization from wav files and TTS models☆33Sep 2, 2022Updated 3 years ago
- BeatNet is state-of-the-art (Real-Time) and Offline joint music beat, downbeat, tempo, and meter tracking system using CRNN and particle …☆451Feb 12, 2025Updated last year
- Official implementation of SawSing (ISMIR'22)☆272Aug 28, 2022Updated 3 years ago
- BigVGAN with Neural Source-Filter☆56Sep 21, 2023Updated 2 years ago
- Repository for the paper: VoiceMe: Personalized voice generation in TTS☆126Apr 29, 2022Updated 3 years ago
- Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)☆124Jun 16, 2022Updated 3 years ago
- ☆123Jan 9, 2020Updated 6 years ago
- ☆12Jul 6, 2023Updated 2 years ago
- A manually annotated dataset of cue points☆13Nov 5, 2019Updated 6 years ago
- 22人で童謡を5曲ずつ歌ってつくった歌唱データ ベースです。☆14Aug 7, 2022Updated 3 years ago
- Source code of APNet2, a vocoder☆58Nov 23, 2023Updated 2 years ago
- Self-supervised learning for real-time pitch estimation☆275Oct 15, 2025Updated 3 months ago
- Generating Chords from Melody with Flexible Harmonic Rhythm and Controllable Harmonic Density [EURASIP JASMP]☆62Jan 15, 2023Updated 3 years ago
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Support material and source code for the model described in : "A Recurrent Encoder-Decoder Approach With Skip-Filtering Connections For M…☆13Sep 19, 2017Updated 8 years ago
- ☆94Oct 16, 2025Updated 3 months ago
- ICANN‘2021: Multi-Modal Chorus Recognition for Improving Song Search☆28Aug 30, 2021Updated 4 years ago
- Scales, Chords, and Cadences: Practical Music Theory for MIR Researchers☆62Nov 8, 2021Updated 4 years ago
- Implementation of CREPE Pitch tracker with PyTorch☆19Jan 28, 2020Updated 6 years ago
- A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.☆45May 25, 2023Updated 2 years ago
- @dharasim The iRealPro jazz chord sequences including tree analysis☆40Jun 28, 2021Updated 4 years ago
- A demo for the ResNet-18 hierarchical classification note segment system☆16Apr 15, 2019Updated 6 years ago
- Machine and Deep Learning models for speech dereverberation☆121Feb 21, 2022Updated 3 years ago
- A diffusion-based cross-lingual voice conversion model, as my bachelor's thesis☆44Jul 24, 2023Updated 2 years ago
- NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022☆307Sep 16, 2023Updated 2 years ago