Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, and sound event detection. Implemented using PyTorch.
☆44Nov 10, 2021Updated 4 years ago
Alternatives and similar repositories for DcaseNet
Users that are interested in DcaseNet are comparing it to the libraries listed below
Sorting:
- CP-JKU submission to DCASE 20☆45Apr 19, 2021Updated 4 years ago
- Baseline method for sound event localization task of DCASE 2021 challenge☆42Jun 15, 2021Updated 4 years ago
- ☆18Nov 10, 2019Updated 6 years ago
- Source code for publication: "Spectrum Correction: Acoustic Scene Classification with Mismatched Recording Devices"☆13Feb 22, 2022Updated 4 years ago
- DCASE2020 Challenge Task 2 baseline system☆115Dec 27, 2022Updated 3 years ago
- DCASE2020 Challenge Task 1 baseline system☆25Jun 22, 2020Updated 5 years ago
- Paderborn Sound Event Detection☆79Jul 18, 2023Updated 2 years ago
- Code for DCASE 2020 task 1a and task 1b.☆88Jan 20, 2022Updated 4 years ago
- Tracking beer/wine using Audio Event Detection with Machine Learning☆15Jun 16, 2024Updated last year
- Official Repository for "Training-Free Multi-Step Audio Source Separation"☆54May 26, 2025Updated 9 months ago
- Reading list for research topics in Sound AI☆196Aug 8, 2024Updated last year
- Comparing Audio Features for Unsupervised Sound Classification☆10Jun 22, 2022Updated 3 years ago
- Baseline for the Spoofing-aware Speaker Verification Challenge 2022☆66May 3, 2022Updated 3 years ago
- Sound event detection with depthwise separable and dilated convolutions.☆52Mar 30, 2020Updated 5 years ago
- In this work we propose two postprocessing approaches applying convolutional neural networks (CNNs) either in the time domain or the ceps…☆28Mar 8, 2020Updated 6 years ago
- ☆13Jan 2, 2025Updated last year
- ☆19Jul 15, 2022Updated 3 years ago
- Visualization toolbox for Sound Event Detection☆123Feb 26, 2024Updated 2 years ago
- Multi-Agent Reinforcement Learning (MARL) method to learn scalable control polices for multi-agent target tracking (IROS22).☆11Jul 22, 2022Updated 3 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆17Nov 9, 2022Updated 3 years ago
- ☆16Jun 15, 2021Updated 4 years ago
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆17Dec 16, 2021Updated 4 years ago
- Inspired by the convolutional recurrent neural network(CRNN) and inception, we propose a multiscale time-frequency convolutional recurren…☆23Apr 15, 2020Updated 5 years ago
- Unsupervised Domain Adaptation for Acoustic Scene Classification with Wasserstein Distance☆14Sep 16, 2020Updated 5 years ago
- ☆20May 13, 2019Updated 6 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- A collection of utilities for Detection and Classification of Acoustic Scenes and Events☆134Apr 3, 2025Updated 11 months ago
- Filter Bank Implementaion as Convolutional Neural Network using Python Keras☆17Dec 18, 2024Updated last year
- PyTorch Implementation of SubSpectralNet - Using Sub-Spectrogram based Convolutional Neural Networks for Acoustic Scene Classification, a…☆22Feb 20, 2019Updated 7 years ago
- Echo aware source separation☆13May 29, 2018Updated 7 years ago
- Python toolkit for likelihood-ratio calibration of binary classifiers☆25Feb 21, 2023Updated 3 years ago
- Surrey CVSSP DCASE 2018 Task 2 system☆20Dec 26, 2022Updated 3 years ago
- ☆23Jan 1, 2021Updated 5 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45May 9, 2022Updated 3 years ago
- A survey on MM-LLMs for long video understanding: From Seconds to Hours: Reviewing MultiModal Large Language Models on Comprehensive Long…☆18Sep 12, 2025Updated 6 months ago
- An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)☆12Nov 13, 2020Updated 5 years ago
- Epoch-synchronous overlap-add (ESOLA) for time-and pitch-scale modification of speech signals.☆22Jul 24, 2020Updated 5 years ago
- 2022 DCASE Challenge☆14Sep 30, 2024Updated last year
- ☆27Jan 17, 2024Updated 2 years ago