itec-hust / MusicYOLOLinks
MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.
☆15Updated 3 years ago
Alternatives and similar repositories for MusicYOLO
Users that are interested in MusicYOLO are comparing it to the libraries listed below
Sorting:
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆38Updated 2 years ago
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆66Updated 2 years ago
- Repository for ISMIR 2022 tutorial T3(M): Designing Controllable Synthesis System for Musical Signals☆28Updated 3 years ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆28Updated 2 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- ☆32Updated last year
- TAPE: An End-to-End Timbre-Aware Pitch Estimator☆23Updated 2 years ago
- ☆60Updated 2 years ago
- A PyTorch Implementation of the paper - Choi, Woosung, et al. "Investigating u-nets with various intermediate blocks for spectrogram-base…☆78Updated 3 years ago
- The official implementation of "TONet: Tone-Octave Network for Singing Melody Extraction from Polyphonic Music"☆42Updated 3 years ago
- The ArtificialSongGenerator automatically composes and compiles the Artifical Audio Multitrack dataset (AAM).☆26Updated 2 weeks ago
- Demucs Lightning: A PyTorch lightning version of Demucs with Hydra and Tensorboard features☆85Updated 2 years ago
- Code for ISMIR 2020 paper: "Multiple F0 Estimation in Vocal Ensembles using Convolutional Neural Networks"☆56Updated last year
- Unofficial implementation of SpecTNT in pytorch☆50Updated 3 years ago
- Chorale Music Separation Dataset and Model Framework☆40Updated 3 years ago
- ☆18Updated 6 years ago
- Fully-Convolutional Network for Pitch Estimation of Speech Signals☆59Updated 2 years ago
- A repo that builds text to music datasets from scratch, used in MuseContorlLite [ICML2025]☆27Updated 6 months ago
- A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …☆91Updated 5 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆44Updated 6 months ago
- ☆17Updated 4 years ago
- ☆12Updated 2 years ago
- Using Word embeddings for automatic EQ mixing☆13Updated 3 years ago
- Utilities for interfacing with Slakh2100☆71Updated last year
- Realization for note segmentation by using hierarchical objective function☆14Updated 6 years ago
- Vocal melody extraction using patch-based CNN☆32Updated 7 years ago
- [ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription☆48Updated last year
- Training, validation, and inference code for various SSL approaches and architectures.☆70Updated last month
- [ismir2019] Learning a Joint Embedding Space of Monophonic and Mixed Music Signals for Singing Voice☆28Updated 2 years ago
- Pitch-shifting and time-stretching with TD-PSOLA☆86Updated 2 years ago