AudioLDM training, finetuning, evaluation and inference.
☆14Mar 27, 2024Updated 2 years ago
Alternatives and similar repositories for AudioLDM-training-finetuning
Users that are interested in AudioLDM-training-finetuning are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆23Aug 4, 2025Updated 9 months ago
- Implementation of the paper, T-FOLEY: A Controllable Waveform-Domain Diffusion Model for Temporal-Event-Guided Foley Sound Synthesis, ac…☆34May 25, 2024Updated 2 years ago
- baseline for IEEE ICME 2024 GC: Semi-supervised Acoustic Scene Classification under Domain Shift☆18Mar 16, 2024Updated 2 years ago
- A multizone sound field control method to synthesize a desired amplitude (or magnitude) distributions over a target region with multiple …☆15Mar 30, 2023Updated 3 years ago
- 通过单层圆形麦克风阵列采集音频,实现MUSIC算法的声源定位。☆23Mar 16, 2023Updated 3 years ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Metrics for evaluating Automated Audio Captioning systems, designed for PyTorch.☆72Mar 22, 2026Updated 2 months ago
- ☆37Nov 14, 2024Updated last year
- 🦇 Encoder of BAT (Learning to Reason about Spatial Sounds with Large Language Models)☆84Feb 13, 2025Updated last year
- This is the repo with the code to conduct a comparative analysis of different audio representation models.☆12Aug 31, 2023Updated 2 years ago
- Source code for paper "Hypernetworks build Implicit Neural Representations of Sounds" from ECML 2023☆13Jun 24, 2023Updated 2 years ago
- ☆17Nov 7, 2023Updated 2 years ago
- 2022 DCASE Challenge☆14Sep 30, 2024Updated last year
- Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"☆13Feb 22, 2022Updated 4 years ago
- Official implementation of "AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and sta…☆50Nov 11, 2025Updated 6 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation of DMT: Dual Mean-Teacher in PyTorch.☆10Oct 27, 2023Updated 2 years ago
- JamendoMaxCaps is a large-scale dataset of 362,000 instrumental creative commons tracks☆51May 24, 2025Updated last year
- Gaussian processes for sound field reconstruction☆22Nov 5, 2020Updated 5 years ago
- ☆13Dec 12, 2025Updated 5 months ago
- ☆50Dec 13, 2025Updated 5 months ago
- A Track-Wise Ensemble Event Independent Network for 3D Polyphonic Sound Event Localization and Detection☆23Nov 14, 2024Updated last year
- ☆21Jul 15, 2024Updated last year
- ☆14Mar 25, 2023Updated 3 years ago
- Code for CVSSP submission to DCASE 2021 Task 6☆36Nov 22, 2022Updated 3 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Jul 19, 2022Updated 3 years ago
- [TCSVT 2024] Implementation of the paper "SiT-MLP: A Simple MLP with Point-wise Topology Feature Learning for Skeleton-based Action Recog…☆19Apr 10, 2024Updated 2 years ago
- Examples of Aspose.3D for Python via .NET☆10Jun 22, 2022Updated 3 years ago
- SelfRemaster: SSL Speech Restoration☆94Jan 5, 2024Updated 2 years ago
- ☆12Jan 10, 2026Updated 4 months ago
- [JBHI 2024] Self-supervised pre-training on ECG collected in the wild☆15Nov 14, 2023Updated 2 years ago
- Unofficial Implementation of "Liu, W., Li, A., Wang, X., Yuan, M., Chen, Y., Zheng, C., & Li, X. (2022). A Neural Beamspace-Domain Filter…☆19Oct 21, 2022Updated 3 years ago
- A spoken version of the textual story cloze benchmark☆22Aug 6, 2023Updated 2 years ago
- ☆13Jan 12, 2024Updated 2 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- Useful tips and tricks to start a repository for a project☆16Dec 13, 2023Updated 2 years ago
- This is the repo of our work titled “Detect All-Type Deepfake Audio: Wavelet Prompt Tuning for Enhanced Auditory Perception”☆33Mar 31, 2026Updated last month
- ☆40Nov 18, 2025Updated 6 months ago
- ☆11Sep 20, 2025Updated 8 months ago
- The dataset and baseline code for Text-to-Audio Grounding (TAG)☆50Oct 23, 2025Updated 7 months ago
- ☆17Sep 19, 2023Updated 2 years ago
- This is the implementation of the manuscript "Learning General All-Neural Speech Enhancement based on Taylor's Approximation Theory", whi…☆14Nov 25, 2022Updated 3 years ago