ispamm / Stable-V2A

Stable-V2A: Synthesis of Synchronized Sound Effect with Temporal and Semantic Controls

☆13

Alternatives and similar repositories for Stable-V2A:

Users that are interested in Stable-V2A are comparing it to the libraries listed below

YoonjinXD / kadtk
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating …
☆63Updated 2 weeks ago
mcomunita / syncfusion
SyncFusion: Multimodal Onset-synchronized Video-to-Audio Foley Synthesis
☆15Updated 8 months ago
gladia-research-group / cocola
☆24Updated last month
XZWY / MSLDM
Implementation of Multi-Source Music Generation with Latent Diffusion.
☆22Updated 6 months ago
sony / diffusion-timbre-transfer
☆40Updated 4 months ago
keshavbhandari / yinyang
☆11Updated last month
anton-jeran / MULTI-AUDIODEC
This is the official implementation of our multi-channel multi-speaker multi-spatial neural audio codec architecture.
☆47Updated 2 weeks ago
NilsDem / control-transfer-diffusion
Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024
☆45Updated last month
ldzhangyx / MusicMagus
The official implementation of the IJCAI 2024 paper "MusicMagus: Zero-Shot Text-to-Music Editing via Diffusion Models".
☆40Updated 6 months ago
naba89 / iSeparate-SDX
iSeparate library for the SDX2023 challenge
☆13Updated last year
haoheliu / SemantiCodec
☆43Updated 9 months ago
yukara-ikemiya / minimal-musicgen-for-developers
[PyTorch] Minimal codebase for MusicGen models
☆58Updated 2 months ago
KyungsuKim42 / tokensynth
The official implementation of TokenSynth (ICASSP 2025)
☆51Updated 3 weeks ago
eloimoliner / audio-inpainting-diffusion
☆64Updated 11 months ago
schufo / umss
Unsupervised Music Source Separation Using Differentiable Parametric Source Models
☆62Updated 2 years ago
jorshi / drumblender
Synthesis of percussion sounds using sinusoidal modelling, DDSP noise synthesis, and a neural source filter approach.
☆30Updated 2 months ago
DiffAPF / LA-2A
Feed-forward compressor experiments source code for "Differentiable All-pole Filters for Time-varying Audio Systems".
☆18Updated 9 months ago
iamycy / duet-svs-diffusion
☆30Updated last year
koichi-saito-sony / ismir2024_tutorial_demo
☆16Updated 4 months ago
gzhu06 / Cacophony
Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986
☆44Updated 5 months ago
seungheondoh / music_caps_dl
Unofficial download repository for MusicCaps
☆46Updated last year
TeeJayBaker / PolyDDSP
Polyphonic generalisation of DDSP
☆18Updated 11 months ago
YoonjinXD / T-FOLEY
Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…
☆28Updated 10 months ago
kwatcharasupat / source-separation-landing
Landing Page for All Things Source Separation
☆24Updated 4 months ago
hyakuchiki / SSSSM-DDSP
Repository for Semi-supervised Synthesizer Sound Matching with Differentiable DSP
☆21Updated 2 years ago
XiaoyuBIE1994 / SDCodec
(ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec
☆29Updated 3 months ago
fundwotsai2001 / AP-adapter
Audio Prompt Adapter: Unleashing music editing abilities for text-to-music with lightweight finetuning [ISMIR 2024]
☆50Updated 5 months ago
xmusic-project / XMIDI_Dataset
XMIDI Dataset: A large-scale symbolic music dataset with emotion and genre labels.
☆18Updated 2 months ago
archinetai / cqt-pytorch
An invertible and differentiable implementation of the Constant-Q Transform (CQT).
☆58Updated 2 years ago
SamsungLabs / Undiff
Test code disclosure for the research paper "UnDiff: Unsupervised Voice Restoration with Unconditional Diffusion Model", as a supplementa…
☆20Updated last year