YoonjinXD / kadtk
A standardized toolkit of Kernel Audio Distance (KAD)—a distribution-free, unbiased, and computationally efficient metric for evaluating generative audio.
☆67Updated last month
Alternatives and similar repositories for kadtk:
Users that are interested in kadtk are comparing it to the libraries listed below
- An invertible and differentiable implementation of the Constant-Q Transform (CQT).☆60Updated 2 years ago
- ☆76Updated last week
- [PyTorch] Minimal codebase for MusicGen models☆60Updated 3 months ago
- ☆43Updated 10 months ago
- A DDSP-based neural voice synthesiser.☆116Updated 5 months ago
- The official implementation of TokenSynth (ICASSP 2025)☆68Updated 2 weeks ago
- PAM is a no-reference audio quality metric for audio generation tasks☆60Updated 9 months ago
- Unsupervised Music Source Separation Using Differentiable Parametric Source Models☆63Updated 2 years ago
- ☆81Updated 2 years ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆31Updated 11 months ago
- (ICASSP 2025) Learning Source Disentanglement in Neural Audio Codec☆32Updated last week
- Source code for training models and using the hyperbolic interface proposed in our ICASSP 2023 paper, “Hyperbolic Audio Source Separation…☆65Updated 2 years ago
- Polyphonic generalisation of DDSP☆19Updated last year
- MelodyT5: A Unified Score-to-Score Transformer for Symbolic Music Processing [ISMIR 2024]☆41Updated 3 months ago
- Unofficial implementation of NANSY++ in Pytorch Lightning☆50Updated last year
- ☆55Updated 6 months ago
- MiRA (Music Replication Assessment) tool is a model-independent open evaluation method based on four diverse audio music similarity metri…☆34Updated 2 months ago
- Frechet Audio Distance evaluation in PyTorch☆35Updated last year
- A pretrained model for "A Phoneme-informed Neural Network Model for Note-level Singing Transcription", ICASSP 2023☆30Updated last year
- Project for MIDI to Audio Synthesis☆23Updated 2 years ago
- Source Separation training codebase for the Sound Demixing Challenge 2023.☆42Updated last year
- Inference codebase for "Cacophony: An Improved Contrastive Audio-Text Model". Preprint: https://arxiv.org/abs/2402.06986☆45Updated 6 months ago
- [ICASSP 2025] "FLowHigh: Towards efficient and high-quality audio super-resolution with single-step flow matching"☆61Updated 3 months ago
- million song dataset split for extended clean tag & artist-level stratified☆49Updated last year
- ☆52Updated last year
- The MIR-MLPop dataset and the official implementation of the paper "MIR-MLPop: A Multilingual Pop Music Dataset with Time-Aligned Lyrics …☆28Updated last year
- Differentiable dynamic range controller in PyTorch.☆48Updated 5 months ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆40Updated 5 months ago
- ☆28Updated last year
- Repository for the paper "Combining audio control and style transfer using latent diffusion", accepted at ISMIR 2024☆47Updated 2 months ago