DCASE2024-Task7-Sound-Scene-Synthesis/AudioLDM-training-finetuning

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/DCASE2024-Task7-Sound-Scene-Synthesis/AudioLDM-training-finetuning)

DCASE2024-Task7-Sound-Scene-Synthesis / AudioLDM-training-finetuning

AudioLDM training, finetuning, evaluation and inference.

☆14

Alternatives and similar repositories for AudioLDM-training-finetuning

Users that are interested in AudioLDM-training-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

YoonjinXD / T-FOLEY
View on GitHub
Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…
☆34May 25, 2024Updated 2 years ago
JishengBai / ICME2024ASC
View on GitHub
baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift
☆18Mar 16, 2024Updated 2 years ago
christhetree / mod_discovery
View on GitHub
Source code for "Modulation Discovery with Differentiable Digital Signal Processing".
☆15Mar 25, 2026Updated 3 months ago
CameronChurchwell / combnet
View on GitHub
☆23Aug 4, 2025Updated 11 months ago
muuda / MUSIC-algorithm-for-circular-microphone-array
View on GitHub
通过单层圆形麦克风阵列采集音频，实现MUSIC算法的声源定位。
☆23Mar 16, 2023Updated 3 years ago
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
Jinbo-Hu / DCASE2022-TASK3
View on GitHub
☆37Nov 14, 2024Updated last year
crypto-code / Music-Representation-Comparison
View on GitHub
This is the repo with the code to conduct a comparative analysis of different audio representation models.
☆11Aug 31, 2023Updated 2 years ago
WUT-AI / hypersound
View on GitHub
Source code for paper "Hypernetworks build Implicit Neural Representations of Sounds" from ECML 2023
☆13Jun 24, 2023Updated 3 years ago
Labbeti / aac-metrics
View on GitHub
Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.
☆75Mar 22, 2026Updated 4 months ago
yqcai888 / DCASE2023
View on GitHub
2022 DCASE Challenge
☆14Sep 30, 2024Updated last year
aeromamba-super-resolution / aeromamba
View on GitHub
Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…
☆50Nov 11, 2025Updated 8 months ago
zszheng147 / Spatial-AST
View on GitHub
🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)
☆87Feb 13, 2025Updated last year
SRPOL-AUI / spectrum-correction
View on GitHub
Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"
☆13Feb 22, 2022Updated 4 years ago
gyx-gloria / DMT
View on GitHub
Official Implementation of DMT: Dual Mean-Teacher in PyTorch.
☆10Oct 27, 2023Updated 2 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
AMAAI-Lab / JamendoMaxCaps
View on GitHub
JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks
☆53May 24, 2025Updated last year
d-caviedes / acoustic_gps
View on GitHub
Gaussian processes for sound field reconstruction
☆22Nov 5, 2020Updated 5 years ago
Jinbo-Hu / L3DAS22-TASK2
View on GitHub
A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection
☆23Nov 14, 2024Updated last year
Yip-Jia-Qi / codecformer
View on GitHub
☆21Jul 15, 2024Updated 2 years ago
felixgontier / dcase-2023-baseline
View on GitHub
☆14Mar 25, 2023Updated 3 years ago
swagshaw / ASC-CL
View on GitHub
Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification
☆14Jul 19, 2022Updated 4 years ago
XinhaoMei / DCASE2021_task6_v2
View on GitHub
Code for CVSSP submission to DCASE 2021 Task 6
☆36Nov 22, 2022Updated 3 years ago
aspose-3d / Aspose.3D-for-Python-via-.NET
View on GitHub
Examples of Aspose.3D for Python via .NET
☆10Jun 22, 2022Updated 4 years ago
Kinyugo / torch_mdct
View on GitHub
A PyTorch implementation of the Modified Discrete Cosine Transform (MDCT) and its inverse for audio processing.
☆33Dec 17, 2024Updated last year
Serverless GPU API endpoints on Runpod - Get Bonus Credits • Ad
Skip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
cs124 / pa0-jupyter-tutorial
View on GitHub
☆12Jan 10, 2026Updated 6 months ago
Takaaki-Saeki / ssl_speech_restoration
View on GitHub
SelfRemaster: SSL Speech Restoration
☆94Jan 5, 2024Updated 2 years ago
klean2050 / tiles_ecg_model
View on GitHub
[JBHI 2024] Self-supervised pre-training on ECG collected in the wild
☆15Nov 14, 2023Updated 2 years ago
biboamy / music-repro
View on GitHub
☆17Nov 7, 2023Updated 2 years ago
yqcai888 / easy_dcase_task1
View on GitHub
This repository provides an easy way to train your models on the datasets of DCASE task 1.
☆20May 28, 2025Updated last year
zhshj0110 / SiT-MLP
View on GitHub
[TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…
☆19Apr 10, 2024Updated 2 years ago
forgi86 / lru-reduction
View on GitHub
Python code of the paper Model order reduction of deep structured state-space models: A system-theoretic approach
☆14Nov 22, 2024Updated last year
COM6012 / Scalable-Machine-Learning
View on GitHub
Legacy repo - Teaching materials for module COM6012 Scalable Machine Learning, University of Sheffield
☆12Feb 9, 2025Updated last year
sp-uhh / derevdps
View on GitHub
☆13Jan 12, 2024Updated 2 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
Jie-su / BPD
View on GitHub
☆11Sep 20, 2025Updated 10 months ago
DolbyLaboratories / universal_transcoder
View on GitHub
☆21Nov 12, 2025Updated 8 months ago
partha2409 / DCASE2024_seld_baseline
View on GitHub
☆52Dec 13, 2025Updated 7 months ago
wsntxxn / TextToAudioGrounding
View on GitHub
The dataset and baseline code for Text-to-Audio Grounding (TAG)
☆49Oct 23, 2025Updated 9 months ago
yoyolicoris / kazane
View on GitHub
Simple sinc interpolation in PyTorch.
☆15Jul 8, 2023Updated 3 years ago
Andong-Li-speech / TaEr
View on GitHub
This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…
☆14Nov 25, 2022Updated 3 years ago
Audio-Experience-Design / LAPChallenge
View on GitHub
The LAP Challenge aims at advancing spatial audio technologies through the personalization of HRTFs.
☆16Aug 12, 2025Updated 11 months ago