georgid/AlignmentDuration

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/georgid/AlignmentDuration)

georgid / AlignmentDuration

Lyrics-to-audio-alignement system. Based on Machine Learning Algorithms: Hidden Markov Models with Viterbi forced alignment. The alignment is explicitly aware of durations of musical notes. The phonetic model are classified with MLP Deep Neural Network.

☆59

Alternatives and similar repositories for AlignmentDuration

Users that are interested in AlignmentDuration are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rupakvignesh / Lyrics-to-Audio-Alignment
View on GitHub
Aligns text (lyrics) with monophonic singing voice (audio). The algorithm uses structural segmentation to segment the audio into structur…
☆94Feb 13, 2018Updated 8 years ago
emirdemirel / ALTA
View on GitHub
A complete training recipe for kaldi-based Automatic Lyrics Transcription.
☆32Nov 30, 2021Updated 4 years ago
jhuang448 / E2E-LyricsAlignment-Implementation
View on GitHub
Implementation of paper "End-to-end lyrics alignment for polyphonic music using an audio-to-character recognition model"
☆18Nov 20, 2022Updated 3 years ago
deezer / MultilingualLyricsToAudioAlignment
View on GitHub
DALI datasets split used to train models presented in the paper Multilingual lyrics-to-audio alignment (ISMIR 2020).
☆13May 25, 2021Updated 5 years ago
KentoW / melody-lyrics
View on GitHub
All source URLs of the 1,000 songs for creating melody-lyric alignment data.
☆15Aug 15, 2019Updated 6 years ago
Open source password manager - Proton Pass • Ad
Securely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
chitralekha18 / AutoLyrixAlign
View on GitHub
Pre-trained model and script to automatically align lyrics to polyphonic audio
☆116Jun 16, 2020Updated 6 years ago
ljuvela / multiscale-GAN
View on GitHub
Code for ICASSP 2019 paper
☆18Oct 29, 2018Updated 7 years ago
jhuang448 / LyricsAlignment-MTL
View on GitHub
☆67Jun 26, 2025Updated last year
SwagLyrics / autosynch
View on GitHub
Automated lyrics-to-audio alignment using syllabic nuclei detection. Developed during Google Summer of Code 2019.
☆53Jul 6, 2023Updated 3 years ago
ronggong / jingjuSingingPhraseMatching
View on GitHub
Code for the paper: Audio to Score Matching by Combining Phonetic and Duration Information
☆29Jul 9, 2017Updated 9 years ago
ronggong / MIREX-2018-Automatic-Lyrics-to-Audio-Alignment
View on GitHub
Util code, issues, discussions
☆29Aug 31, 2018Updated 7 years ago
georgid / Lyrics2AudioAligner
View on GitHub
lyrics-to-audio-alignement system. Initially done using HTK for rapid prototyping
☆14Mar 14, 2018Updated 8 years ago
rabitt / contour_classification
View on GitHub
code for research project on melody extraction by contour classification
☆17May 26, 2016Updated 10 years ago
f90 / jamendolyrics
View on GitHub
DEPRECATED: Jamendo music dataset with time-aligned lyrics for lyrics alignment evaluation
☆88Apr 30, 2025Updated last year
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
shamidreza / unitselection
View on GitHub
A python implementation of a simple Unit Selection Text-to-Speech (TTS) synthesis system. It works with CMU-Arctic data by default
☆11Mar 14, 2015Updated 11 years ago
georgid / AlignmentEvaluation
View on GitHub
Scripts for computing common lyrics-to-audio alignment evaluation metrics. Usable evaluation for any token-based alignment (e.g. if tok…
☆18Oct 27, 2020Updated 5 years ago
i3thuan5 / hts_engine_python
View on GitHub
python wrap for hts engine
☆14Jan 30, 2018Updated 8 years ago
gabolsgabs / DALI
View on GitHub
DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.
☆380Jun 11, 2020Updated 6 years ago
zaocan666 / CollageNet
View on GitHub
code and demo of the ISMIR 2021 paper CollageNet
☆12Jul 12, 2021Updated 5 years ago
emirdemirel / ASA_ICASSP2021
View on GitHub
A duration-invariant audio-to-lyrics alignment pipeline with low memory footprint which segments long music recordings via a recursive bi…
☆15Oct 13, 2022Updated 3 years ago
tachi-hi / euterpe
View on GitHub
Real-time Audio-to-audio Karaoke Generation System for Monaural Music
☆42Mar 5, 2026Updated 4 months ago
Khalian / Modulo7
View on GitHub
A semantic and technical analysis of musical scores based on Information Retrieval Principles
☆15Oct 13, 2022Updated 3 years ago
MZehren / Automix
View on GitHub
Automatic DJ-mixing of tracks
☆40Feb 11, 2020Updated 6 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jayneelparekh / sp2si-code
View on GitHub
Contains code for our work on speech to singing conversion (ICASSP 2020)
☆50Oct 27, 2020Updated 5 years ago
juheo / Adversarially-Trained-End-to-end-Korean-Singing-Voice-Synthesis-System
View on GitHub
Adversarially Trained End-to-end Korean SInging Voice Synthesis System
☆54Nov 26, 2019Updated 6 years ago
bill317996 / Melody-extraction-with-melodic-segnet
View on GitHub
The source code of "A Streamlined Encoder/Decoder Architecture for Melody Extraction"
☆74Feb 10, 2020Updated 6 years ago
cyhuang-tw / robust-vc
View on GitHub
☆11May 7, 2022Updated 4 years ago
ffont / ismir2016
View on GitHub
Instructions for reproducing the research described in the paper "Tempo Estimation for Music Loops and a Simple Confidence Measure"
☆14Nov 18, 2016Updated 9 years ago
p0p4k / Matcha-TTS-2
View on GitHub
E2E TTS using Conditional Flow Matching (Experimental*)
☆71Nov 10, 2023Updated 2 years ago
SJTMusicTeam / SVS_system
View on GitHub
A system works on singing voice synthesis
☆79Jan 11, 2023Updated 3 years ago
CSTR-Edinburgh / magphase
View on GitHub
MagPhase Vocoder: Speech analysis/synthesis system for TTS and related applications.
☆80Oct 14, 2019Updated 6 years ago
lmaxwell / Armednn
View on GitHub
cross-platform modular neural network inference library, small and efficient
☆13May 15, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
evanotero / deep-music-genre-classification
View on GitHub
🎵 Using Deep Learning to Categorize Music as Time Progresses Through Spectrogram Analysis
☆22Dec 26, 2017Updated 8 years ago
revsic / torch-retriever-vc
View on GitHub
PyTorch implementation of Retriever: Learning Content-Style Representation
☆12Jan 27, 2023Updated 3 years ago
tachi-hi / slidingHPSS
View on GitHub
sliding HPSS and two stage HPSS (singing voice enhancement)
☆17Oct 9, 2020Updated 5 years ago
guxm2021 / ALT_SpeechBrain
View on GitHub
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
☆51May 7, 2024Updated 2 years ago
johnglover / metamorph
View on GitHub
Metamorph is an open source library for performing high-level sound transformations based on a sinusoids plus noise plus transients model…
☆19Jun 23, 2013Updated 13 years ago
andi611 / ZeroSpeech-TTS-without-T
View on GitHub
A Pytorch implementation for the ZeroSpeech 2019 challenge.
☆112Nov 12, 2019Updated 6 years ago
sp-nitech / SPTK
View on GitHub
A suite of speech signal processing tools
☆246Jul 14, 2026Updated last week