SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning
☆53Jul 28, 2025Updated 9 months ago
Alternatives and similar repositories for SonicVerse
Users that are interested in SonicVerse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Improving Symbolic Music Generation with Inference-Time Alignment☆22Aug 2, 2025Updated 9 months ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆51May 24, 2025Updated 11 months ago
- SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering☆176May 2, 2026Updated 3 weeks ago
- Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…☆39Oct 26, 2025Updated 6 months ago
- ☆11Nov 14, 2024Updated last year
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bit…☆12Jul 29, 2025Updated 9 months ago
- Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""☆15Jun 28, 2024Updated last year
- A large-scale dataset of caption-annotated MIDI files.☆84Jul 23, 2024Updated last year
- ☆25Jun 19, 2025Updated 11 months ago
- [EMNLP 2025 Findings] Official code for EZ-VC: Easy Zero-shot Any-to-Any Voice Conversion☆38Sep 9, 2025Updated 8 months ago
- Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…☆17Sep 25, 2025Updated 7 months ago
- Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…☆43Sep 11, 2024Updated last year
- "Enhancing Neural Audio Fingerprint Robustness to Audio Degradation for Music Identification" ISMIR2025☆36Sep 11, 2025Updated 8 months ago
- Variable Bitrate Residual Vector Quantization for Audio Coding☆52May 1, 2025Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.☆91Jun 19, 2025Updated 11 months ago
- Music Information Retrieval Feature Library for Extraction☆45Nov 14, 2024Updated last year
- ☆32Nov 25, 2023Updated 2 years ago
- Beat annotations for the beat tracker Beat This!☆13Mar 2, 2026Updated 2 months ago
- A pitch detection model trained to be robust against noise and reverberation environments.☆27Jan 21, 2025Updated last year
- Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)☆103May 12, 2026Updated last week
- ☆15Sep 20, 2023Updated 2 years ago
- Tool to aid in the creation of mashups☆19Apr 7, 2020Updated 6 years ago
- Pytorch project accompanying the paper "Comparing Deep Models and Evaluation Strategies for Multi-Pitch Estimation in Music Recordings", …☆13Aug 26, 2022Updated 3 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Joint Embedding Predictive Architecture for Musical Stem Compatibility Estimation☆54Aug 6, 2024Updated last year
- Estimating musical surprisal/information content in Audio☆28Apr 9, 2026Updated last month
- The source code for the paper XiaoiceSing2 (interspeech2023)☆49Jan 15, 2024Updated 2 years ago
- Code for ChordSync, a conformer-based audio-to-chord synchroniser☆14Oct 17, 2025Updated 7 months ago
- Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language mode…☆164Feb 28, 2025Updated last year
- A python algorithm to change the pitch of the voice in real time☆13Dec 13, 2020Updated 5 years ago
- ☆19Feb 2, 2023Updated 3 years ago
- An open agentic system built on smolagents, integrating multimodal state-of-the-art music AI models for understanding, generation, and in…☆29Feb 6, 2026Updated 3 months ago
- Lyrics and Vocal Melody Generation conditioned on Accompaniment☆28Aug 27, 2022Updated 3 years ago
- End-to-end encrypted email - Proton Mail • AdSpecial offer: 40% Off Yearly / 80% Off First Month. All Proton services are open source and independently audited for security.
- A dataset of 173 progressive metal songs, in both GuitarPro and token formats, as per the specifications in DadaGP.☆18Nov 19, 2024Updated last year
- This repository contains a series of works on diffusion-based speech tokenizers, including the official implementation of the paper: "TaD…☆77Jan 25, 2026Updated 3 months ago
- Additional material for the paper ADTOF: A large dataset of non-synthetic music for automatic drum transcription☆77Sep 18, 2025Updated 8 months ago
- Fine-tune Stable Audio Open with DiT ControlNet.☆253May 16, 2025Updated last year
- Encode and decode audio samples to/from compressed latent representations!☆256Sep 19, 2025Updated 8 months ago
- JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment☆162Aug 7, 2025Updated 9 months ago
- Perceived Music Quality Dataset☆12Jul 1, 2024Updated last year