☆18Aug 16, 2025Updated 6 months ago
Alternatives and similar repositories for dcase2025_task4_baseline
Users that are interested in dcase2025_task4_baseline are comparing it to the libraries listed below
Sorting:
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆30Sep 18, 2023Updated 2 years ago
- 6 DoF Directional Room Impulse Response (RIR) with Dense Loudspeaker Grid☆17Aug 31, 2023Updated 2 years ago
- Sound Event Detection (SED) paper collection☆17Jun 26, 2024Updated last year
- ☆22Mar 19, 2025Updated 11 months ago
- Improving Recording Device Generalization using Impulse Response Augmentation☆20Apr 24, 2025Updated 10 months ago
- ☆28Oct 17, 2024Updated last year
- Query-conditioned target sound extraction model☆30Mar 25, 2025Updated 11 months ago
- Prediction of sound event bounding boxes (SEBBs)☆32Aug 2, 2024Updated last year
- This repository aims to collect Transformer-based sound event detection (SED) algorithms.☆93Feb 10, 2026Updated 3 weeks ago
- Code for the paper "Self-Supervised Learning for Anomalous Sound Detection"☆41May 13, 2024Updated last year
- Official Pytorch implementation of PULSE: Positive–Unlabelled Learning for audio Signal Enhancement (Best Paper Award at ICASSP 2023)☆43Jul 24, 2023Updated 2 years ago
- Official implementation of Bayes Conditional Distribution Estimation for Knowledge Distillation Based on Conditional Mutual Information☆11Sep 28, 2023Updated 2 years ago
- ☆16Jun 12, 2025Updated 8 months ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Sep 27, 2024Updated last year
- Survey of audio language models☆62Feb 4, 2026Updated 3 weeks ago
- ☆43Feb 21, 2023Updated 3 years ago
- ☆10Oct 16, 2025Updated 4 months ago
- Official implementation of "Modeling Multi-Task Model Merging as Adaptive Projective Gradient Descent".☆21May 23, 2025Updated 9 months ago
- Diffusion-based Speech Enhancement: Demonstration of Performance and Generalization☆12Dec 21, 2024Updated last year
- text-only training or language-free training for multimodal tasks (image/audio/video caption, retrieval, text2image)☆12Oct 15, 2024Updated last year
- ☆10May 16, 2024Updated last year
- This is a Dockerfile to use stable_diffusion.openvino in Docker container.☆13Aug 29, 2022Updated 3 years ago
- 日本音響学会誌用BibTeXスタイルファイル☆11Jan 24, 2022Updated 4 years ago
- Onset-and-Offset-Aware Sound Event Detection☆21Feb 10, 2025Updated last year
- Code and data recipes for the paper: Optimal Condition Training for Target Source Separation by Efthymios Tzinis, Gordon Wichern, Paris S…☆14Feb 15, 2023Updated 3 years ago
- ☆207Dec 5, 2024Updated last year
- ☆13Aug 13, 2023Updated 2 years ago
- ☆13Jul 10, 2021Updated 4 years ago
- Spatial Voice Conversion: Voice Conversion Preserving Spatial Information and Non-target Signals☆18Aug 8, 2024Updated last year
- Data generator for stereo sound event localization and detection task of DCASE 2025 challenge☆14Jul 17, 2025Updated 7 months ago
- AV-GS: Learning Material and Geometry Aware Priors for Novel View Acoustic Synthesis☆12Oct 3, 2024Updated last year
- ☆11Feb 14, 2025Updated last year
- Enhanced sound event localization and detection in real 360-degree audio-visual soundscapes (DCASE task3 format)☆13Mar 21, 2025Updated 11 months ago
- PyTorch implementation of Swin Transformer for 1-dimensional data☆17Mar 15, 2024Updated last year
- ☆12Nov 1, 2024Updated last year
- ☆16Jan 11, 2026Updated last month
- ☆11Mar 5, 2024Updated last year
- ☆13Feb 4, 2025Updated last year
- ☆13Mar 11, 2025Updated 11 months ago