SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
☆50Jul 28, 2025Updated 7 months ago
Alternatives and similar repositories for SonicVerse
Users that are interested in SonicVerse are comparing it to the libraries listed below
Sorting:
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 7 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆47May 24, 2025Updated 9 months ago
- SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering☆158Aug 25, 2025Updated 6 months ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 4 months ago
- ☆12Nov 14, 2024Updated last year
- This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bit…☆12Jul 29, 2025Updated 7 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- A large-scale dataset of caption-annotated MIDI files.☆79Jul 23, 2024Updated last year
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 6 months ago
- ☆25Jun 19, 2025Updated 9 months ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆42Sep 11, 2024Updated last year
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 5 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆35Sep 11, 2025Updated 6 months ago
- Music Information Retrieval Feature Library for Extraction☆35Nov 14, 2024Updated last year
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 10 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆78Jun 19, 2025Updated 9 months ago
- ☆32Nov 25, 2023Updated 2 years ago
- Beat annotations for the beat tracker Beat This!☆13Mar 2, 2026Updated 3 weeks ago
- Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)☆95Dec 23, 2025Updated 3 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- ☆15Sep 20, 2023Updated 2 years ago
- Tool to aid in the creation of mashups☆19Apr 7, 2020Updated 5 years ago
- Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …☆13Aug 26, 2022Updated 3 years ago
- Estimating musical surprisal/information content in Audio☆23Jan 19, 2026Updated 2 months ago
- Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation☆51Aug 6, 2024Updated last year
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆13Oct 17, 2025Updated 5 months ago
- Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language mode…☆155Feb 28, 2025Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- Lyrics and Vocal Melody Generation conditioned on Accompaniment☆29Aug 27, 2022Updated 3 years ago
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- Official implementation of WildFX Dataset Generating pipeline.☆15Oct 21, 2025Updated 5 months ago
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated last month
- Additional material for the paper ADTOF: A large dataset of non-synthetic music for automatic drum transcription☆69Sep 18, 2025Updated 6 months ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆249May 16, 2025Updated 10 months ago
- Encode and decode audio samples to/from compressed latent representations!☆251Sep 19, 2025Updated 6 months ago
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆155Aug 7, 2025Updated 7 months ago
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆106Mar 3, 2026Updated 2 weeks ago