[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing
☆27Aug 30, 2024Updated last year
Alternatives and similar repositories for SVT_SpeechBrain
Users that are interested in SVT_SpeechBrain are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code accompayning ISMIR23 paper; TriAD: Capturing harmonics with 3D convolutions☆19Jul 19, 2024Updated last year
- High-Resolution Violin Transcription using Weak Labels☆37Oct 29, 2023Updated 2 years ago
- ☆13Feb 3, 2026Updated 2 months ago
- Frechet Audio Distance evaluation in PyTorch☆36Jun 9, 2023Updated 2 years ago
- [MM 2022] MM-ALT: A Multimodal Automatic Lyric Transcription System (Oral, Top paper award)☆21Mar 16, 2024Updated 2 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Unconditional music synthesis using a diffusion model in the STFT domain☆12May 31, 2022Updated 3 years ago
- Extension of Sinsy-NG using deep learning models for voice conversion in order to synthesize good and realistic vocals.☆13Aug 14, 2020Updated 5 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Sep 4, 2022Updated 3 years ago
- Supplementary Materials of ISMIR 2022 paper "Analysis and detection of singing techniques in repertoires of J-POP solo singers" by Yuya Y…☆23Apr 23, 2024Updated last year
- Source code of paper "Adapting pretrained speech model for Mandarin lyrics transcription and alignment"☆18Dec 14, 2023Updated 2 years ago
- Template demonstrating how a manager may use Silver Bullet☆13Jul 7, 2023Updated 2 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- ☆32Nov 25, 2023Updated 2 years ago
- ☆39Feb 1, 2024Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆14Feb 4, 2026Updated 2 months ago
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆49May 7, 2024Updated last year
- A hackathon project to explore reworking the Mattermost Plugin API.☆11Aug 22, 2023Updated 2 years ago
- PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model☆80Dec 6, 2023Updated 2 years ago
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- Comparative Analysis of Graph Neural Networks for Node Regression task on Wiki-Squirrel dataset (Bachelor's Research Project)☆13Nov 6, 2025Updated 5 months ago
- ☆15Jun 13, 2024Updated last year
- This is the official implementation of our neural-network-based fast diffuse room impulse response generator (FAST-RIR) for generating r…☆12Nov 30, 2021Updated 4 years ago
- Tutorial covering Open Source tools for Source Separation.☆15Nov 12, 2021Updated 4 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [CVPR 2025] Official implementation of paper "Prosody-Enhanced Acoustic Pre-training and Acoustic-Disentangled Prosody Adapting for Movie…☆23Jun 6, 2025Updated 10 months ago
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆65Feb 19, 2025Updated last year
- A light digital audio workstation in JS☆14Jan 27, 2023Updated 3 years ago
- Code for "A diffusion-inspired training strategy for singing voice extraction in the waveform domain" (ISMIR 2022)☆17Feb 16, 2023Updated 3 years ago
- (Experimental) Predicting hand assignments in piano MIDI using neural networks☆13Oct 11, 2024Updated last year
- Node For Max Music experiments☆13Feb 15, 2018Updated 8 years ago
- MAPS ( MIDI Aligned Piano Sounds ) dataset python api for machine learning☆11Jun 26, 2018Updated 7 years ago
- Official repository of Fast-ULCNet.☆29Feb 4, 2026Updated 2 months ago
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆35Apr 22, 2024Updated last year
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- Implementation of Acoustic BPE (Shen et al., 2024), extended for RVQ-based Neural Audio Codecs☆76Dec 3, 2025Updated 4 months ago
- ☆15Sep 24, 2022Updated 3 years ago
- wake-up word emotion recognition [APSIPA 2022]☆17Nov 11, 2022Updated 3 years ago
- This repository provides the materials used in "Unsupervised Melody-to-Lyric Generation" by Yufei Tian, Anjali Narayan-Chen, Shereen Orab…☆11Jul 6, 2023Updated 2 years ago
- ☆12Sep 23, 2021Updated 4 years ago
- 🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production☆37Mar 10, 2022Updated 4 years ago
- Deep learning for automatic mixing☆30Aug 29, 2024Updated last year