denfed / wave-spec-fusion
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆12Updated 3 years ago
Related projects ⓘ
Alternatives and complementary repositories for wave-spec-fusion
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆27Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆35Updated 2 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 3 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆35Updated last month
- ☆26Updated last year
- ☆49Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆21Updated 8 months ago
- ☆36Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆45Updated 2 years ago
- experiments about AudioSet☆43Updated last year
- ☆13Updated 2 years ago
- ☆53Updated 4 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆29Updated last month
- ☆46Updated last year
- AudioLDM training, finetuning, evaluation and inference.☆13Updated 7 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- ☆62Updated 2 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆29Updated last month
- Code for reproducing experiments in "Domain-Adversarial Voice Activity Detection"☆23Updated 4 years ago
- ☆18Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- ☆15Updated 2 years ago
- ☆29Updated 3 years ago
- ☆23Updated last year
- Official implementation of Efficient Speech Separation Framework Based on Neural State-Space Models☆18Updated last year
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated last year
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆66Updated 3 years ago
- This repo provides the network code and the processed samples of the manuscript "Glance and Gaze: A Collaborative Learning Framework for …☆65Updated 2 years ago