SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
☆52Jul 28, 2025Updated 8 months ago
Alternatives and similar repositories for SonicVerse
Users that are interested in SonicVerse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving Symbolic Music Generation with Inference-Time Alignment☆20Aug 2, 2025Updated 8 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆48May 24, 2025Updated 10 months ago
- SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering☆162Aug 25, 2025Updated 7 months ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆38Oct 26, 2025Updated 5 months ago
- ☆12Nov 14, 2024Updated last year
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bit…☆12Jul 29, 2025Updated 8 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- A large-scale dataset of caption-annotated MIDI files.☆79Jul 23, 2024Updated last year
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆36Sep 9, 2025Updated 7 months ago
- ☆25Jun 19, 2025Updated 9 months ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆43Sep 11, 2024Updated last year
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 6 months ago
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆35Sep 11, 2025Updated 7 months ago
- Music Information Retrieval Feature Library for Extraction☆40Nov 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Variable Bitrate Residual Vector Quantization for Audio Coding☆50May 1, 2025Updated 11 months ago
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆83Jun 19, 2025Updated 9 months ago
- ☆32Nov 25, 2023Updated 2 years ago
- Beat annotations for the beat tracker Beat This!☆13Mar 2, 2026Updated last month
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)☆99Dec 23, 2025Updated 3 months ago
- ☆15Sep 20, 2023Updated 2 years ago
- Tool to aid in the creation of mashups☆19Apr 7, 2020Updated 6 years ago
- Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …☆13Aug 26, 2022Updated 3 years ago
- NordVPN Special Discount Offer • AdSave on top-rated NordVPN 1 or 2-year plans with secure browsing, privacy protection, and support for for all major platforms.
- Estimating musical surprisal/information content in Audio☆24Mar 23, 2026Updated 2 weeks ago
- Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation☆52Aug 6, 2024Updated last year
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆14Oct 17, 2025Updated 5 months ago
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language mode…☆160Feb 28, 2025Updated last year
- ☆19Feb 2, 2023Updated 3 years ago
- Lyrics and Vocal Melody Generation conditioned on Accompaniment☆28Aug 27, 2022Updated 3 years ago
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆17Nov 19, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆76Jan 25, 2026Updated 2 months ago
- Additional material for the paper ADTOF: A large dataset of non-synthetic music for automatic drum transcription☆69Sep 18, 2025Updated 6 months ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆249May 16, 2025Updated 10 months ago
- Encode and decode audio samples to/from compressed latent representations!☆251Sep 19, 2025Updated 6 months ago
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆156Aug 7, 2025Updated 8 months ago
- Perceived Music Quality Dataset☆12Jul 1, 2024Updated last year
- Elucidated Text-To-Audio (ETTA) is a SOTA text-to-audio model with a holistic understanding of the design space and trained with syntheti…☆114Mar 3, 2026Updated last month