sony / DiffRollLinks
PyTorch implementation of DiffRoll, a diffusion-based generative automatic music transcription (AMT) model
☆79Updated last year
Alternatives and similar repositories for DiffRoll
Users that are interested in DiffRoll are comparing it to the libraries listed below
Sorting:
- ☆85Updated 2 years ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆40Updated 8 months ago
- Official Implementation of Jointist☆37Updated 2 years ago
- ☆29Updated 2 years ago
- million song dataset split for extended clean tag & artist-level stratified☆52Updated 2 years ago
- Polyffusion: A Diffusion Model for Polyphonic Score Generation with Internal and External Controls☆83Updated last year
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆44Updated 2 years ago
- A piano music dataset with Audio, Symbolic and Text labels☆33Updated 8 months ago
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆55Updated 8 months ago
- ReconVAT: a semi-supervised automatic music transcription (AMT) model☆38Updated last year
- ☆39Updated 3 years ago
- Official repository for the paper - SLAP: Siamese Language-Audio Pretraining without negative samples for Music Understanding☆51Updated last month
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆89Updated 5 months ago
- MR-MT3: Memory Retaining Multi-Track Music Transcription to Mitigate Instrument Leakage☆49Updated 4 months ago
- ☆33Updated last year
- jazznet dataset of piano patterns for music audio machine learning research☆79Updated 2 years ago
- Project for MIDI to Audio Synthesis☆25Updated 2 years ago
- Full models and training code for PESTO☆71Updated last year
- ☆38Updated 2 years ago
- [PyTorch] Minimal codebase for MusicGen models☆63Updated 10 months ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Updated 11 months ago
- The source code and pre-trained model of the paper "On the Preparation and Validation of a Large-scale Dataset"☆60Updated 3 years ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆44Updated 5 months ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆65Updated 2 years ago
- Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)☆81Updated last month
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆44Updated 9 months ago
- list of MIR dataset papers presented at ISMIR 2022☆61Updated 2 years ago
- This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBER…☆33Updated 3 years ago
- Deep Performer: Score-to-audio music performance synthesis☆44Updated 2 years ago