mjhydri/Singing-Vocal-Beat-Tracking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/mjhydri/Singing-Vocal-Beat-Tracking)

mjhydri / Singing-Vocal-Beat-Tracking

This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…

☆35

Alternatives and similar repositories for Singing-Vocal-Beat-Tracking

Users that are interested in Singing-Vocal-Beat-Tracking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

navi0105 / LyricAlignment
View on GitHub
Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"
☆19Dec 14, 2023Updated 2 years ago
mjhydri / 1D-StateSpace
View on GitHub
This repository contains the implementation of an efficient joint beat, downbeat, tempo, and meter tracking system using a compact 1D pro…
☆74Nov 28, 2023Updated 2 years ago
slychief / ismir2018_tutorial
View on GitHub
☆126Jan 9, 2020Updated 6 years ago
georgid / lakh_vocal_segments_dataset
View on GitHub
singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/
☆20Dec 30, 2019Updated 6 years ago
guxm2021 / SVT_SpeechBrain
View on GitHub
[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆28Aug 30, 2024Updated last year
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
xinyiguan / py2lispIDyOM
View on GitHub
A Python package for IDyOM
☆14Mar 31, 2023Updated 3 years ago
migperfer / AutoMashupper
View on GitHub
Tool to aid in the creation of mashups
☆21Apr 7, 2020Updated 6 years ago
amazon-science / unsupervised-melody-to-lyrics-generation
View on GitHub
This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…
☆11Jul 6, 2023Updated 3 years ago
google-deepmind / slowfast_nfnets
View on GitHub
☆30Jun 22, 2022Updated 4 years ago
andreamust / harte-library
View on GitHub
Extension of the music21 library for working with music chords encoded according to the Harte Notation.
☆14Apr 30, 2024Updated 2 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
bryanwuAC / audio2vec
View on GitHub
☆10Mar 12, 2019Updated 7 years ago
oatsu-gh / enunu_kodoku_singing
View on GitHub
22人で童謡を5曲ずつ歌ってつくった歌唱データベースです。
☆15Aug 7, 2022Updated 3 years ago
groupmm / libf0
View on GitHub
A Python Library for Fundamental Frequency Estimation in Music Recordings
☆55Jun 5, 2026Updated last month
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
babe269 / performant
View on GitHub
A toolset for easy formant extraction and visualization from wav files and TTS models
☆33Sep 2, 2022Updated 3 years ago
zhuole1025 / LyricWhiz
View on GitHub
[ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
☆56Nov 20, 2023Updated 2 years ago
PlayVoice / BigVGAN
View on GitHub
BigVGAN with Neural Source-Filter
☆58Sep 21, 2023Updated 2 years ago
mjhydri / BeatNet
View on GitHub
BeatNet is state-of-the-art (Real-Time) and Offline joint music beat, downbeat, tempo, and meter tracking system using CRNN and particle …
☆504Apr 13, 2026Updated 3 months ago
SonyCSLParis / pesto
View on GitHub
Self-supervised learning for real-time pitch estimation
☆297Oct 15, 2025Updated 9 months ago
yl4579 / AuxiliaryASR
View on GitHub
Joint CTC-S2S Phoneme-level ASR for Voice Conversion and TTS (Text-Mel Alignment)
☆127Jun 16, 2022Updated 4 years ago
josephding23 / Free-Midi-Library
View on GitHub
Crawled from FreeMidi.org, MIDI files library including over twenty thousand files!
☆33Jun 6, 2020Updated 6 years ago
polvanrijn / VoiceMe
View on GitHub
Repository for the paper: VoiceMe: Personalized voice generation in TTS
☆125Apr 29, 2022Updated 4 years ago
zxxwxyyy / sonique
View on GitHub
Video Background Music Generation Using Unpaired Audio-Visual Data
☆33Oct 8, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
redmist328 / APNet2
View on GitHub
Source code of APNet2, a vocoder
☆60Nov 23, 2023Updated 2 years ago
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
Js-Mim / mlsp2017_svsep_skipfilt
View on GitHub
Support material and source code for the model described in : "A Recurrent Encoder-Decoder Approach With Skip-Filtering Connections For M…
☆13Sep 19, 2017Updated 8 years ago
LEEYOONHYUNG / GraphTTS
View on GitHub
☆12Jul 6, 2023Updated 3 years ago
zaocan666 / CollageNet
View on GitHub
code and demo of the ISMIR 2021 paper CollageNet
☆12Jul 12, 2021Updated 5 years ago
sander-wood / autoharmonizer
View on GitHub
Generating Chords from Melody with Flexible Harmonic Rhythm and Controllable Harmonic Density [EURASIP JASMP]
☆64Jan 15, 2023Updated 3 years ago
muthissar / diffstm
View on GitHub
☆10Dec 16, 2022Updated 3 years ago
archinetai / cqt-pytorch
View on GitHub
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
☆73Dec 9, 2022Updated 3 years ago
maum-ai / nuwave2
View on GitHub
NU-Wave 2: A General Neural Audio Upsampling Model for Various Sampling Rates @ INTERSPEECH 2022
☆312Sep 16, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
otnemrasordep / ismir2022-datasets
View on GitHub
list of MIR dataset papers presented at ISMIR 2022
☆61Dec 11, 2022Updated 3 years ago
jcdevaney / ISMIR-musicTheoryTutorial
View on GitHub
Scales, Chords, and Cadences: Practical Music Theory for MIR Researchers
☆62Nov 8, 2021Updated 4 years ago
Jackson-Kang / MFARunner
View on GitHub
A simple tool to easily use Montreal Forced Aligner. Also provide alignment(TextGrid) retrieved from ESD.
☆45May 25, 2023Updated 3 years ago
diegocarrera89 / quantTree
View on GitHub
☆11Jul 25, 2023Updated 3 years ago
sony / DiffRoll
View on GitHub
PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model
☆81Dec 6, 2023Updated 2 years ago
bzamecnik / deep-instrument-heroku
View on GitHub
ML model to classify music instruments from audio - Heroku deployment.
☆18Oct 16, 2016Updated 9 years ago
bryan051003 / USVG
View on GitHub
A unified model for zero-shot singing voice conversion and synthesis
☆22Nov 30, 2022Updated 3 years ago