XZWY/MSLDM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/XZWY/MSLDM)

XZWY / MSLDM

Implementation of Multi-Source Music Generation with Latent Diffusion.

☆29

Alternatives and similar repositories for MSLDM

Users that are interested in MSLDM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

XZWY / SpatialCodec
View on GitHub
Implementation of SpatialCodec.
☆71Sep 23, 2023Updated 2 years ago
merlresearch / reverberation-as-supervision
View on GitHub
Enhanced Reverberation As Supervision (ERAS) for unsupervised reverberant speech separation
☆15Aug 1, 2024Updated last year
i-need-sleep / mad
View on GitHub
☆16Sep 29, 2025Updated 9 months ago
gladia-research-group / cocola
View on GitHub
☆39Jan 9, 2026Updated 6 months ago
ldzhangyx / MusicMagus
View on GitHub
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆49Sep 11, 2024Updated last year
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
madhavlab / wav2tok
View on GitHub
Codebase for ICLR' 23 paper- ''wav2tok: Deep Sequence Tokenizer for Audio Retrieval"
☆36Jun 30, 2026Updated 3 weeks ago
YoonjinXD / kadtk
View on GitHub
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆104Jun 12, 2025Updated last year
CameronChurchwell / combnet
View on GitHub
☆23Aug 4, 2025Updated 11 months ago
gladia-research-group / multi-source-diffusion-models
View on GitHub
☆171Aug 14, 2023Updated 2 years ago
chrisdonahue / fall23-phd-prospectives
View on GitHub
Info for prospective PhD students for Chris Donahue's lab at CMU starting Fall 23.
☆12Nov 13, 2022Updated 3 years ago
SonyCSLParis / codicodec
View on GitHub
Encode and decode audio samples to/from continuous and discrete compressed representations!
☆121Nov 25, 2025Updated 7 months ago
eloimoliner / BABE
View on GitHub
Zero-Shot Blind Audio Bandwidth Extension
☆27May 25, 2023Updated 3 years ago
iamycy / duet-svs-diffusion
View on GitHub
☆31Nov 5, 2023Updated 2 years ago
bernardo-torres / linear-autoencoders
View on GitHub
Official code and pretrained models for Linear Consistency Autoencoders (Lin-CAE), a method to induce linearity in audio autoencoders via…
☆17Feb 12, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
KyungsuKim42 / tokensynth
View on GitHub
The official implementation of TokenSynth (ICASSP 2025)
☆91Jun 24, 2026Updated 3 weeks ago
rodrigo-castellon / jukemirlib
View on GitHub
A simple library for extracting representations from Jukebox
☆39Nov 16, 2025Updated 8 months ago
ArrayDPS / ArrayDPS
View on GitHub
☆40May 12, 2025Updated last year
rd20karim / M2T-Segmentation
View on GitHub
[NCA] Official implementation of the paper Motion2Language, Unsupervised learning of synchronized semantic motion segmentation
☆14Sep 9, 2024Updated last year
fundwotsai2001 / Text-to-Music_control_family
View on GitHub
Containing SOTA methods that follows time-varying conditions for Text-to-Music
☆24Jan 1, 2026Updated 6 months ago
haiciyang / LaDiffCodec
View on GitHub
ICASSP 2024 - Generative De-Quantization for Neural Speech Codec via Latent Diffusion.
☆56Nov 16, 2025Updated 8 months ago
yangzhao1230 / newPCMDM
View on GitHub
☆13Nov 20, 2023Updated 2 years ago
tencent-ailab / MuCodec
View on GitHub
☆168Nov 22, 2024Updated last year
ETH-DISCO / discoder
View on GitHub
Official repo for DisCoder: High-Fidelity Music Vocoder using Neural Audio Codecs presented at ICASSP 2025
☆42Feb 24, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
manoskary / weavemuse
View on GitHub
An open agentic system built on smolagents, integrating multimodal state-of-the-art music AI models for understanding, generation, and in…
☆32Feb 6, 2026Updated 5 months ago
Aratako / CALM-DACVAE
View on GitHub
An attempt to reproduce CALM (Continuous Audio Language Models) using DACVAE as the audio VAE.
☆17Feb 20, 2026Updated 5 months ago
claroche-r / FastDiffusionEM
View on GitHub
☆29Apr 23, 2024Updated 2 years ago
stet-stet / DDD
View on GitHub
code for "DDD: A Perceptually Superior Low-Response-Time DNN-Based Declipper"
☆30Apr 12, 2024Updated 2 years ago
WangHelin1997 / SoloAudio
View on GitHub
SoloAudio: Target Sound Extraction with Language-oriented Audio Diffusion Transformer.
☆119Jan 28, 2026Updated 5 months ago
jeonchangbin49 / LimitAug
View on GitHub
☆23Aug 30, 2022Updated 3 years ago
minju0821 / musical_instrument_retrieval
View on GitHub
☆29Jun 8, 2023Updated 3 years ago
HarunoriKawano / BEST-RQ
View on GitHub
Implementation of the paper "Self-supervised Learning with Random-projection Quantizer for Speech Recognition" in Pytorch.
☆96May 25, 2023Updated 3 years ago
yongyizang / MSRKit
View on GitHub
Model Implementations, Evaluation Scripts, etc. for Music Source Restoration Challenge 2025.
☆23Nov 14, 2025Updated 8 months ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
AmphionTeam / FlexiCodec
View on GitHub
[ICLR2026] FlexiCodec: A Dynamic Neural Audio Codec for Low Frame Rates
☆50Jul 1, 2026Updated 3 weeks ago
xjuspeech / YOLOPitch
View on GitHub
☆10Jun 11, 2024Updated 2 years ago
stet-stet / goct_ismir2023
View on GitHub
code for "BEAT-ALIGNED SPECTROGRAM-TO-SEQUENCE GENERATION OF RHYTHM-GAME CHARTS" (ISMIR 2023 LBD)
☆22Jan 29, 2024Updated 2 years ago
sony / soundctm
View on GitHub
Pytorch implementation of SoundCTM
☆101Mar 31, 2025Updated last year
ldzhangyx / instruct-MusicGen
View on GitHub
The official implementation of our paper "Instruct-MusicGen: Unlocking Text-to-Music Editing for Music Language Models via Instruction Tu…
☆109Jan 14, 2026Updated 6 months ago
yukara-ikemiya / minimal-sqvae
View on GitHub
A minimal Pytorch Implementation of Stochastically Quantized Variational AutoEncoder (SQ-VAE) by Sony
☆33Oct 16, 2023Updated 2 years ago
felixperfler / Stable-Hybrid-Auditory-Filterbanks
View on GitHub
[Interspeech 2024] Hold Me Tight: Stable Encoder-Decoder Design for Speech Enhancement
☆43Jul 25, 2025Updated 11 months ago