Examples for ICASSP2024 paper "StemGen: A music generation model that listens"
☆35Dec 19, 2023Updated 2 years ago
Alternatives and similar repositories for stemgen
Users that are interested in stemgen are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metri…☆34Nov 14, 2025Updated 4 months ago
- Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]☆58Nov 10, 2025Updated 4 months ago
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆46Jan 23, 2025Updated last year
- "Fx-Encoder++: Extracting Instrument-wise Audio Effect Representations from Mixtures"☆49Aug 23, 2025Updated 7 months ago
- Codes and MIDI demos of ISMIR 2022 paper: Domain Adversarial Training on Conditional Variational Auto-Encoder for Controllable Music Gene…☆21Mar 28, 2023Updated 3 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- ☆117Updated this week
- ☆11Feb 8, 2024Updated 2 years ago
- ZIQI-Eval: A Music Evaluation Benchmark for Large Language Models☆16Jul 23, 2024Updated last year
- ☆32Jan 6, 2022Updated 4 years ago
- Official implementation of "Learning Music Audio Representations Via Weak Language Supervision" (ICASSP 2022)☆47Dec 3, 2024Updated last year
- ☆35Sep 7, 2022Updated 3 years ago
- This is the official implementation of MusER (AAAI'24).☆30Jun 4, 2025Updated 9 months ago
- My hybrid TTS network that combines, VALL-E, VoiceBox, SpeechFlow, Seamless and TortoiseTTS into one☆26Aug 5, 2024Updated last year
- Compute distribution-based quality metrics for audio data using embeddings, with a focus on music.☆43Jan 15, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Code for ICLR 2024 Paper: CompA: Addressing the Gap in Compositional Reasoning in Audio-Language Models☆22Jul 10, 2024Updated last year
- MuChoMusic is a benchmark for evaluating music understanding in multimodal audio-language models.☆44Dec 3, 2024Updated last year
- Supervised and unsupervised Concept-based explanation of pretrained music classifiers☆12Jul 27, 2023Updated 2 years ago
- The hybrid architecture is based on the idea that we could simply apply a GAN method (GANSpace) to another GAN model (GANSynth).☆25Aug 16, 2021Updated 4 years ago
- A Representation Evaluation Framework for Music Information Retrieval tasks☆53Apr 9, 2024Updated last year
- Codebase for 'A Real-Time Lyrics Alignment System Using Chroma And Phonetic Features For Classical Vocal Performance', ICASSP 2024☆13Oct 4, 2024Updated last year
- The Song Describer dataset is an evaluation dataset made of ~1.1k captions for 706 permissively licensed music recordings.☆168Dec 22, 2023Updated 2 years ago
- MusAV: a dataset of relative arousal-valence annotations for validation of audio models☆17Dec 16, 2022Updated 3 years ago
- The official implementation of TokenSynth (ICASSP 2025)☆81Oct 27, 2025Updated 5 months ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Pytorch implementation of automatic music transcription method that uses a two-level hierarchical frequency-time Transformer architecture…☆112Jul 11, 2023Updated 2 years ago
- Textless Speech-to-Music Retrieval Using Emotion Similarity [ICASSP23]☆17Aug 16, 2023Updated 2 years ago
- ScorePerformer: Expressive Piano Performance Rendering with Fine-Grained Control (ISMIR 2023)☆41Mar 10, 2025Updated last year
- ☆20Feb 19, 2026Updated last month
- Dataset of dry/wet pairs for audio effects research☆39Apr 17, 2025Updated 11 months ago
- Baseline for DCASE 2024 Task 9: "Language-Queried Audio Source Separation"☆26Mar 27, 2024Updated 2 years ago
- Codebase for the paper 'EncodecMAE: Leveraging neural codecs for universal audio representation learning'☆101Jul 24, 2024Updated last year
- Multitrack music mixing style transfer given a reference song using differentiable mixing console.☆58Jul 7, 2025Updated 8 months ago
- Fast C++ implementation of ESOLA using KFRLib, can be used for online time-stretch augmentation during SpeechToText training.☆16Jul 25, 2020Updated 5 years ago
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- Fine-tune your own MusicGen with LoRA☆158Apr 26, 2024Updated last year
- A simple library for Fréchet Audio Distance (FAD) calculation☆253Aug 22, 2025Updated 7 months ago
- Synthesize music in Python using any audio plugins, both realtime and offline (batch-processing)☆15Jan 7, 2023Updated 3 years ago
- Pre-training, fine-tuning, and inference code with the MAEST models for music analysis applications.☆56Jun 27, 2025Updated 9 months ago
- Prosody and Pronunciation Modification Network☆63May 5, 2025Updated 10 months ago
- ☆38Jun 16, 2024Updated last year
- A program for visualizing music theory from sheet music transcriptions.☆15Feb 1, 2026Updated last month