[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆26Aug 30, 2024Updated last year
Alternatives and similar repositories for SVT_SpeechBrain
Users that are interested in SVT_SpeechBrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions☆19Jul 19, 2024Updated last year
- High-Resolution Violin Transcription using Weak Labels☆36Oct 29, 2023Updated 2 years ago
- ☆12Feb 3, 2026Updated last month
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- [MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral, Top paper award)☆21Mar 16, 2024Updated 2 years ago
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Extension of Sinsy-NG using deep learning models for voice conversion in order to synthesize good and realistic vocals.☆13Aug 14, 2020Updated 5 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Sep 4, 2022Updated 3 years ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆23Apr 23, 2024Updated last year
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆18Dec 14, 2023Updated 2 years ago
- Template demonstrating how a manager may use Silver Bullet☆13Jul 7, 2023Updated 2 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- ☆38Feb 1, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆14Feb 4, 2026Updated last month
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆49May 7, 2024Updated last year
- A hackathon project to explore reworking the Mattermost Plugin API.☆11Aug 22, 2023Updated 2 years ago
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model☆80Dec 6, 2023Updated 2 years ago
- Comparative Analysis of Graph Neural Networks for Node Regression task on Wiki-Squirrel dataset (Bachelor's Research Project)☆12Nov 6, 2025Updated 4 months ago
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- ☆15Jun 13, 2024Updated last year
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 9 months ago
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆63Feb 19, 2025Updated last year
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- A light digital audio workstation in JS☆14Jan 27, 2023Updated 3 years ago
- (Experimental) Predicting hand assignments in piano MIDI using neural networks☆13Oct 11, 2024Updated last year
- Node For Max Music experiments☆13Feb 15, 2018Updated 8 years ago
- MAPS ( MIDI Aligned Piano Sounds ) dataset python api for machine learning☆11Jun 26, 2018Updated 7 years ago
- Official repository of Fast-ULCNet.☆28Feb 4, 2026Updated last month
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆34Apr 22, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Dec 3, 2025Updated 3 months ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- ☆15Sep 24, 2022Updated 3 years ago
- This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…☆11Jul 6, 2023Updated 2 years ago
- ☆12Sep 23, 2021Updated 4 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37Mar 10, 2022Updated 4 years ago
- ☆21Aug 25, 2025Updated 7 months ago