denfed / wave-spec-fusion
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆12Updated 3 years ago
Alternatives and similar repositories for wave-spec-fusion:
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- ☆53Updated 4 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- ☆18Updated 2 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆39Updated 2 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆17Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆22Updated 10 months ago
- ☆63Updated 4 months ago
- ☆29Updated 3 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 3 months ago
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆20Updated last month
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆37Updated 2 years ago
- ☆30Updated last year
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 4 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆68Updated 3 years ago
- Official implementation for our paper "Audio Mamba: Selective State Spaces for Self-Supervised Audio Representations"☆33Updated 7 months ago
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆18Updated last year
- PyTorch implementation of LiMuSE☆30Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆39Updated 3 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆30Updated 6 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆11Updated 2 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 2 years ago
- ☆49Updated 2 years ago
- This package aims at simplifying the download of the AudioSet dataset.☆45Updated last year
- ☆20Updated 3 months ago