guxm2021/MM_ALT

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guxm2021/MM_ALT)

guxm2021 / MM_ALT

[MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral, Top paper award)

☆21

Alternatives and similar repositories for MM_ALT

Users that are interested in MM_ALT are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

guxm2021 / ALT_SpeechBrain
View on GitHub
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
☆51May 7, 2024Updated 2 years ago
xk-wang / MusicYOLO
View on GitHub
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
☆11Jan 29, 2022Updated 4 years ago
guxm2021 / SVT_SpeechBrain
View on GitHub
[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆28Aug 30, 2024Updated last year
YisongMiao / DiSQ-Score
View on GitHub
The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understandi…
☆18Aug 7, 2024Updated last year
wei-zeng98 / piano-a2s
View on GitHub
End-to-end real-world polyphonic piano audio-to-score transcription with hierarchical decoding (IJCAI 2024)
☆41Sep 17, 2024Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
ashispati / GuitarSoloDetection
View on GitHub
Code accompanying AES Semantic Audio Conference paper titled "A Dataset and Method for Guitar Solo Detection in Rock Music"
☆11Jan 18, 2018Updated 8 years ago
sony / DiffRoll
View on GitHub
PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model
☆81Dec 6, 2023Updated 2 years ago
sail-sg / Video-Next-Event-Prediction
View on GitHub
☆28Aug 9, 2025Updated 11 months ago
haonan3 / V1
View on GitHub
V1: Toward Multimodal Reasoning by Designing Auxiliary Task
☆36Apr 14, 2025Updated last year
Kvothe045 / Audio-Enhancer
View on GitHub
☆13Aug 3, 2025Updated 11 months ago
christianazinn / MIDI-RWKV
View on GitHub
☆24Jan 24, 2026Updated 6 months ago
jack1yang / image-paragraph-captioning
View on GitHub
A Hierarchical Approach for Generating Descriptive Image Paragraphs
☆10Mar 27, 2020Updated 6 years ago
SJTMusicTeam / MusicGeneration
View on GitHub
☆10May 15, 2021Updated 5 years ago
naba89 / iSeparate-SDX
View on GitHub
iSeparate library for the SDX2023 challenge
☆15Dec 15, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
nethermanpro / ComSL
View on GitHub
☆11Oct 14, 2023Updated 2 years ago
sanderwood / melodyt5
View on GitHub
MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]
☆50Jan 23, 2025Updated last year
davidliujiafeng / ccom_mdx2023
View on GitHub
☆10Jun 6, 2023Updated 3 years ago
yoongi43 / music_source_separation
View on GitHub
☆14Jan 12, 2023Updated 3 years ago
AnzorGozalishvili / sifrank_serving
View on GitHub
A better working example of SIFRank and SIFRank+ models for keyword extraction. Easy to setup using docker-compose.
☆11Oct 29, 2024Updated last year
felixCheungcheung / mixing_secrets_v2
View on GitHub
A NEW VERSION OF MIXING SECRETS DATASET FOR MUSIC SOURCE SEPARATION
☆22Mar 3, 2023Updated 3 years ago
zhaojw1998 / Query-and-reArrange
View on GitHub
Code and demo for paper: Zhao et al., "Q&A: Query-Based Representation Learning for Multi-Track Symbolic Music re-Arrangement," IJCAI 202…
☆21May 2, 2024Updated 2 years ago
ytyz1307zzh / TextGeneration_Transformer
View on GitHub
text generation from keywords using transformer model
☆12Nov 2, 2019Updated 6 years ago
AmphionTeam / AnyAccomp
View on GitHub
AnyAccomp: Generalizable accompaniment generation for vocals and solo instruments, powered by a quantized melodic bottleneck.
☆39Dec 22, 2025Updated 7 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
yl4467 / singer
View on GitHub
☆15Feb 22, 2025Updated last year
lmxue / NVV-SuperBench
View on GitHub
NVV-SuperBench: Beyond Words, Beyond Quality—Benchmarking Nonverbal Vocalizations in Speech Generation (Interspeech 2026 long paper)
☆18Jun 21, 2026Updated last month
sail-sg / AnytimeReasoner
View on GitHub
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
☆54Jul 15, 2025Updated last year
cheriell / ICASSP2021-A2S
View on GitHub
accompanying code for my ICASSP2021 paper
☆19Jan 6, 2022Updated 4 years ago
jekim5418 / DPM
View on GitHub
Official code for DPM : A Novel Training Method for Physics-Informed Neural Networks in Extrapolation
☆10Nov 2, 2021Updated 4 years ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
janson9192 / autokws2021
View on GitHub
☆13Mar 25, 2021Updated 5 years ago
sail-sg / lm-random-memory-access
View on GitHub
☆15Mar 12, 2024Updated 2 years ago
jlfwong / hvac-sim-app
View on GitHub
A library for modeling loads and costs for heat pumps, furnaces, air conditioners etc
☆13Apr 30, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
js05212 / PyTorch-for-NPN
View on GitHub
Officially unofficial PyTorch code for the NIPS paper 'Natural-Parameter Networks: A Class of Probabilistic Neural Networks'
☆11Sep 28, 2021Updated 4 years ago
sail-sg / tty-use
View on GitHub
☆15Oct 13, 2025Updated 9 months ago
WING-NUS / ELCo
View on GitHub
The Dataset and Official Implementation for <The ELCo Dataset: Bridging Emoji and Lexical Composition> @ LREC-COLING 2024
☆16May 11, 2024Updated 2 years ago
albertnahas / aissist
View on GitHub
A local-first, AI-powered CLI personal assistant for tracking goals, reflections, and context
☆16Jan 9, 2026Updated 6 months ago
slliugit / slliugit.github.io
View on GitHub
music denoising network
☆16Sep 24, 2024Updated last year
isamborskiy / NUS-QE
View on GitHub
LaTeX style for NUS Qualifying Examination.
☆19Mar 13, 2018Updated 8 years ago
gabolsgabs / DALI
View on GitHub
DALI: a large Dataset of synchronised Audio, LyrIcs and vocal notes.
☆380Jun 11, 2020Updated 6 years ago