navi0105/LyricAlignment

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/navi0105/LyricAlignment)

navi0105 / LyricAlignment

Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"

☆19

Alternatives and similar repositories for LyricAlignment

Users that are interested in LyricAlignment are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

mjhydri / Singing-Vocal-Beat-Tracking
View on GitHub
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…
☆35Sep 4, 2022Updated 3 years ago
zhuole1025 / LyricWhiz
View on GitHub
[ISMIR 2023] LyricWhiz: Robust Multilingual Zero-shot Lyrics Transcription by Whispering to ChatGPT
☆56Nov 20, 2023Updated 2 years ago
samsad35 / code-ancogen
View on GitHub
[ICASSP 2025] AnCoGen: Analysis, Control and Generation of Speech with a Masked Autoencoder
☆14Mar 11, 2025Updated last year
georgid / lakh_vocal_segments_dataset
View on GitHub
singing voice with annotations of vocal onsets, based on the matched MIDI from http://colinraffel.com/projects/lmd/
☆20Dec 30, 2019Updated 6 years ago
emirdemirel / DALI-TestSet4ALT
View on GitHub
This is a subset of the DALI set consisting of 240 polyphonic recordings that is used to benchmark lyrics transcription evaluation.
☆12Nov 30, 2021Updated 4 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
aaronserianni / attention-iou
View on GitHub
[CVPR'25] Attention IoU: Examining Biases in CelebA using Attention Maps
☆13Mar 26, 2025Updated last year
guxm2021 / SVT_SpeechBrain
View on GitHub
[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆28Aug 30, 2024Updated last year
Sonata165 / ControllableLyricTranslation
View on GitHub
Code for the paper "Songs Across Borders: Singable and Controllable Neural Lyric Translation"
☆26Feb 3, 2026Updated 5 months ago
uthree / ddsp-vocoder
View on GitHub
☆12Nov 7, 2024Updated last year
yufenhuang / MOSA-Music-mOtion-and-Semantic-Annotation-dataset
View on GitHub
☆23Jun 13, 2024Updated 2 years ago
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
nicolaus625 / CMI-bench
View on GitHub
☆18Jun 24, 2025Updated last year
BakerBunker / SALT
View on GitHub
[ASRU 2023] Code of paper SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation
☆23Aug 13, 2024Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
napulen / AugmentedNet
View on GitHub
A Roman Numeral Analysis Network with Synthetic Training Examples and Additional Tonal Tasks
☆50Feb 11, 2024Updated 2 years ago
Princeton-CDH / geniza
View on GitHub
version 4.x of the Princeton Geniza Project
☆13Jul 9, 2026Updated 2 weeks ago
RayYuki / CodecBench
View on GitHub
☆24Nov 16, 2025Updated 8 months ago
Mddct / simple-tts
View on GitHub
（WIP）long form speech generatoins
☆30Apr 2, 2025Updated last year
migperfer / TriAD-ISMIR2023
View on GitHub
Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions
☆20Jul 19, 2024Updated 2 years ago
xxbidiao / GenerationMania
View on GitHub
GenerationMania: Generate IIDX-style rhythm action game stages
☆12Aug 3, 2019Updated 6 years ago
Sungwon-Han / DualFair
View on GitHub
☆14Jan 28, 2023Updated 3 years ago
yamathcy / ISMIR2022J-POP
View on GitHub
Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…
☆23Apr 23, 2024Updated 2 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
JeremieHornus / bezier-interpolation
View on GitHub
☆11Feb 20, 2025Updated last year
bryan051003 / USVG
View on GitHub
A unified model for zero-shot singing voice conversion and synthesis
☆22Nov 30, 2022Updated 3 years ago
SarthakYadav / audiomae-plusplus-official
View on GitHub
Official repository for the paper "AudioMAE++: learning better masked audio representations with SwiGLU FFNs"
☆15Apr 30, 2026Updated 2 months ago
Hannieliao / Baton
View on GitHub
Official Repository of IJCAI 2024 Paper: "BATON: Aligning Text-to-Audio Model with Human Preference Feedback"
☆32Mar 4, 2025Updated last year
carlosholivan / symbolic-music-structure-analysis
View on GitHub
☆19Mar 27, 2023Updated 3 years ago
RickyL-2000 / ROSVOT
View on GitHub
Robust Singing Voice Transcription and MIDI Extraction
☆123Nov 20, 2024Updated last year
LiChaiUSTC / CSL-L2M
View on GitHub
☆18May 4, 2025Updated last year
YatingMusic / ddsp-singing-vocoders
View on GitHub
Official implementation of SawSing (ISMIR'22)
☆275Aug 28, 2022Updated 3 years ago
zzw922cn / wesinger2
View on GitHub
Synthesized singing voice demos of WeSinger 2 paper.
☆26Feb 20, 2023Updated 3 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
mmorise / itako_singing
View on GitHub
東北イタコ歌唱データベースの最新ラベルデータ
☆24Jul 1, 2021Updated 5 years ago
ldzhangyx / music-melody-segmentation-using-neural-CRF
View on GitHub
☆13Nov 2, 2020Updated 5 years ago
schufo / plla-tisvs
View on GitHub
Phoneme Level Lyrics Alignment and Text-Informed Singing Voice Separation
☆24Nov 8, 2021Updated 4 years ago
qiuqiao / SOFA
View on GitHub
SOFA: Singing-Oriented Forced Aligner
☆226May 16, 2025Updated last year
sony / bigvsan
View on GitHub
Pytorch implementation of BigVSAN
☆203Dec 9, 2025Updated 7 months ago
fluxions-ai / stftvae
View on GitHub
Inference for the STFT-VAE continuous audio codec (24kHz, 3.125Hz latent)
☆43Jul 12, 2026Updated last week
jannik-brinkmann / social-biases-in-vision-transformers
View on GitHub
[ICCV 2023] Official PyTorch implementation of "A Multidimensional Analysis of Social Biases in Vision Transformers"
☆13Aug 11, 2023Updated 2 years ago