guxm2021/SVT_SpeechBrain

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/guxm2021/SVT_SpeechBrain)

guxm2021 / SVT_SpeechBrain

[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing

☆28

Alternatives and similar repositories for SVT_SpeechBrain

Users that are interested in SVT_SpeechBrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

migperfer / TriAD-ISMIR2023
View on GitHub
Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions
☆20Jul 19, 2024Updated 2 years ago
hanshounsu / d3rm
View on GitHub
☆14Feb 3, 2026Updated 5 months ago
drscotthawley / fad_pytorch
View on GitHub
Frechet Audio Distance evaluation in PyTorch
☆36Jun 9, 2023Updated 3 years ago
eloimoliner / unconditional-diff-STFT
View on GitHub
Unconditional music synthesis using a diffusion model in the STFT domain
☆12May 31, 2022Updated 4 years ago
guxm2021 / MM_ALT
View on GitHub
[MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral, Top paper award)
☆21Mar 16, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
MTG / violin-transcription
View on GitHub
High-Resolution Violin Transcription using Weak Labels
☆41Oct 29, 2023Updated 2 years ago
mathigatti / DeepSingingSynthesizer
View on GitHub
Extension of Sinsy-NG using deep learning models for voice conversion in order to synthesize good and realistic vocals.
☆13Aug 14, 2020Updated 5 years ago
mjhydri / Singing-Vocal-Beat-Tracking
View on GitHub
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…
☆35Sep 4, 2022Updated 3 years ago
yamathcy / ISMIR2022J-POP
View on GitHub
Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…
☆23Apr 23, 2024Updated 2 years ago
navi0105 / LyricAlignment
View on GitHub
Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"
☆19Dec 14, 2023Updated 2 years ago
silverbulletmd / silverbullet-manager-space-template
View on GitHub
Template demonstrating how a manager may use Silver Bullet
☆13Jul 7, 2023Updated 3 years ago
guozixunnicolas / FundamentalMusicEmbedding
View on GitHub
☆32Nov 25, 2023Updated 2 years ago
DanielMengLiu / AudioVisualLip
View on GitHub
☆25Feb 20, 2024Updated 2 years ago
joanne-b-nortier / UDiffSE
View on GitHub
☆41Feb 1, 2024Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
otnemrasordep / ProgGP
View on GitHub
A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.
☆18Nov 19, 2024Updated last year
mattermost / mattermost-plugin-api
View on GitHub
A hackathon project to explore reworking the Mattermost Plugin API.
☆11Aug 22, 2023Updated 2 years ago
sony / DiffRoll
View on GitHub
PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model
☆81Dec 6, 2023Updated 2 years ago
guxm2021 / ALT_SpeechBrain
View on GitHub
[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription
☆51May 7, 2024Updated 2 years ago
mattermost-community / mattermost-plugin-webex
View on GitHub
☆15Jul 16, 2026Updated last week
EvelynZhou / FAST-RIR
View on GitHub
This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…
☆12Nov 30, 2021Updated 4 years ago
mdx-tutorial / mdx-tutorial.github.io
View on GitHub
Tutorial covering Open Source tools for Source Separation.
☆15Nov 12, 2021Updated 4 years ago
zamirmehdi / GNN-Node-Regression
View on GitHub
Comparative Analysis of Graph Neural Networks for Node Regression task on Wiki-Squirrel dataset (Bachelor's Research Project)
☆13Nov 6, 2025Updated 8 months ago
ZZDoog / ProDubber
View on GitHub
[CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…
☆23Jun 6, 2025Updated last year
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
NieeiM / Dasheng-Audiogen
View on GitHub
Generate a complete audio clip with music, intelligible speech, and sound effects from text in one pass.
☆44May 27, 2026Updated last month
qchenevier / scribbleton-live
View on GitHub
A light digital audio workstation in JS
☆14Jan 27, 2023Updated 3 years ago
JuliaMusic / PianoHands.jl
View on GitHub
(Experimental) Predicting hand assignments in piano MIDI using neural networks
☆13Oct 11, 2024Updated last year
genisplaja / diffusion-vocal-sep
View on GitHub
Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)
☆17Feb 16, 2023Updated 3 years ago
Cycling74 / node-music-theory
View on GitHub
Node For Max Music experiments
☆13Feb 15, 2018Updated 8 years ago
york135 / MIRMLPop
View on GitHub
The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …
☆35Apr 22, 2024Updated 2 years ago
morganmcg1 / wandb_spectrogram
View on GitHub
☆15Sep 24, 2022Updated 3 years ago
JackJamesLoth / GOAT-Dataset
View on GitHub
A Large Dataset of Paired Guitar Audio Recordings and Tablatures
☆25Sep 30, 2025Updated 9 months ago
seungheondoh / hi_kia
View on GitHub
wake-up word emotion recognition [APSIPA 2022]
☆17Nov 11, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
exeex / maps-dataset
View on GitHub
MAPS ( MIDI Aligned Piano Sounds ) dataset python api for machine learning
☆11Jun 26, 2018Updated 8 years ago
AbrahamSanders / codec-bpe
View on GitHub
Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs
☆76Dec 3, 2025Updated 7 months ago
amazon-science / unsupervised-melody-to-lyrics-generation
View on GitHub
This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…
☆11Jul 6, 2023Updated 3 years ago
dl4am / tutorial
View on GitHub
Deep learning for automatic mixing
☆32Aug 29, 2024Updated last year
Edresson / Coqui-TTS
View on GitHub
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
☆37Mar 10, 2022Updated 4 years ago
PRamoneda / RL_PianoFingering
View on GitHub
☆13Sep 23, 2021Updated 4 years ago
stanstan324234 / gosync
View on GitHub
Go-style channel and waitGroup for js to handle task queue.
☆17Mar 26, 2025Updated last year