AMAAI-Lab/SonicVerse

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/AMAAI-Lab/SonicVerse)

AMAAI-Lab / SonicVerse

SonicVerse: Multi-Task Learning for Music Feature-Informed Captioning

☆53

Alternatives and similar repositories for SonicVerse

Users that are interested in SonicVerse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

AMAAI-Lab / t2m-inferalign
View on GitHub
Improving Symbolic Music Generation with Inference-Time Alignment
☆22Aug 2, 2025Updated 11 months ago
AMAAI-Lab / SonicMaster
View on GitHub
SonicMaster: Towards Controllable All-in-One Music Restoration and Mastering
☆189Jun 5, 2026Updated last month
AMAAI-Lab / MelodySim
View on GitHub
MelodySim: Measuring Melody-aware Music Similarity for Plagiarism Detection
☆29May 29, 2025Updated last year
declare-lab / nora-1.5
View on GitHub
NORA-1.5: A Vision-Language-Action Model Trained using World Model- and Action-based Preference Rewards
☆109Jan 11, 2026Updated 6 months ago
AMAAI-Lab / MidiCaps
View on GitHub
A large-scale dataset of caption-annotated MIDI files.
☆85Jul 23, 2024Updated 2 years ago
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
AMAAI-Lab / megamusicaps
View on GitHub
☆11Nov 14, 2024Updated last year
AMAAI-Lab / mirflex
View on GitHub
Music Information Retrieval Feature Library for Extraction
☆57Nov 14, 2024Updated last year
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
AMAAI-Lab / nnAudio2
View on GitHub
GPU-based audio processing. Trainable Fourier kernels.
☆35Jun 8, 2026Updated last month
OpenMOSS / MOSS-Music
View on GitHub
MOSS-Music is an open-source music understanding model for targeting musical captioning, lyrics ASR, structural analysis, chord / key / t…
☆121May 9, 2026Updated 2 months ago
AMAAI-Lab / PreBit
View on GitHub
This is the repo accompanying the paper: "A multimodal model with Twitter FinBERT embeddings for extreme price movement prediction of Bit…
☆12Jul 29, 2025Updated 11 months ago
AMAAI-Lab / Text2midi
View on GitHub
Text2midi is the first end-to-end model for generating MIDI files from textual descriptions. By leveraging pretrained large language mode…
☆174Feb 28, 2025Updated last year
declare-lab / jamify
View on GitHub
JAM: A Tiny Flow-based Song Generator with Fine-grained Controllability and Aesthetic Alignment
☆167Aug 7, 2025Updated 11 months ago
guozixunnicolas / FundamentalMusicEmbedding
View on GitHub
☆32Nov 25, 2023Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
jhuang448 / MultilingualALT
View on GitHub
Repo of the paper "Towards Building an End-to-End Multilingual Automatic Lyrics Transcription Model""
☆15Jun 28, 2024Updated 2 years ago
AMAAI-Lab / DART
View on GitHub
Demo for DART, Audio Imagination workshop submission in NeurIPS 2024
☆16Apr 22, 2026Updated 3 months ago
loubbrad / aria-midi
View on GitHub
Official repository for Aria-MIDI: a MIDI dataset of 1,186,253 transcribed solo-piano recordings.
☆98Jun 19, 2025Updated last year
migperfer / AutoMashupper
View on GitHub
Tool to aid in the creation of mashups
☆21Apr 7, 2020Updated 6 years ago
0417keito / JEN-1-COMPOSER-pytorch
View on GitHub
Unofficial implementation JEN-1 Composer: A Unified Framework for High-Fidelity Multi-Track Music Generation(https://arxiv.org/abs/2310.1…
☆32Jan 19, 2024Updated 2 years ago
EmilianPostolache / stable-audio-controlnet
View on GitHub
Fine-tune Stable Audio Open with DiT ControlNet.
☆256May 16, 2025Updated last year
SonyCSLParis / audioic
View on GitHub
Estimating musical surprisal/information content in Audio
☆34Apr 9, 2026Updated 3 months ago
smulelabs / windowed-roformer
View on GitHub
Official Repository for "Efficient Vocal Source Separation Through Windowed RoFormer"
☆45Oct 30, 2025Updated 8 months ago
tencent-ailab / MuQ
View on GitHub
Official repository of the paper "MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector Quantization".
☆357Aug 4, 2025Updated 11 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
WildHoneyPie / BEAST
View on GitHub
Codes for ICASSP 2024 paper: BEAST: Online Joint Beat and Downbeat Tracking Based on Streaming Transformer. An online beat tracking syste…
☆44Sep 11, 2024Updated last year
fundwotsai2001 / Text-to-Music_control_family
View on GitHub
Containing SOTA methods that follows time-varying conditions for Text-to-Music
☆24Jan 1, 2026Updated 6 months ago
chenjianyi / fastsag
View on GitHub
FastSAG: Towards Fast Non-Autoregressive Singing Accompaniment Generation
☆29Dec 19, 2024Updated last year
Pliploop / GDRetriever
View on GitHub
Official implementation of the paper - GD-Retriever: Controllable generative text-music retrieval with diffusion models (Accepted at ISMI…
☆19Sep 25, 2025Updated 9 months ago
SonyCSLParis / music2latent
View on GitHub
Encode and decode audio samples to/from compressed latent representations!
☆267Sep 19, 2025Updated 10 months ago
LAION-AI / scaled-echo-tts
View on GitHub
Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024…
☆24Mar 29, 2026Updated 3 months ago
HanxunH / AudioMosaic
View on GitHub
[ICML2026] AudioMosaic: Contrastive Masked Audio Representation Learning
☆23May 15, 2026Updated 2 months ago
yukara-ikemiya / minimal-musicgen-for-developers
View on GitHub
[PyTorch] Minimal codebase for MusicGen models
☆63Jan 7, 2025Updated last year
WikiChao / ZeroSep
View on GitHub
[NeurIPS 2025] Separate Anything in Audio with Zero Training
☆60Nov 3, 2025Updated 8 months ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
EleutherAI / aria
View on GitHub
Official repository for the paper: Scaling Self-Supervised Representation Learning for Symbolic Piano Performance (ISMIR 2025)
☆108May 12, 2026Updated 2 months ago
astradzhao / music-rfm
View on GitHub
Open Source code for our paper, Steering Autoregressive Music Generation with Recursive Feature Machines (Zhao et al., 2025). aka MusicRF…
☆40Oct 26, 2025Updated 8 months ago
SonyResearch / VRVQ
View on GitHub
Variable Bitrate Residual Vector Quantization for Audio Coding
☆54May 1, 2025Updated last year
fundwotsai2001 / MuseControlLite
View on GitHub
MuseControlLite: Multifunctional Music Generation with Lightweight Conditioners [ICML 2025]
☆67Jan 6, 2026Updated 6 months ago
CPJKU / beat_this_annotations
View on GitHub
Beat annotations for the beat tracker Beat This!
☆14Mar 2, 2026Updated 4 months ago
yhj137 / PianistTransformer
View on GitHub
This is the official implementation for the paper "Pianist Transformer: Towards Expressive Piano Performance Rendering via Scalable Self-…
☆45Jun 25, 2026Updated 3 weeks ago
deezer / skey
View on GitHub
Self-supervised key estimation model that matches performance with supervised state-of-the-art model.
☆63Jun 9, 2025Updated last year