denfed / wave-spec-fusion
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆12Updated 3 years ago
Alternatives and similar repositories for wave-spec-fusion:
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ☆18Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆2Updated 2 weeks ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆18Updated 2 years ago
- Audio activity detector based on per-channel energy normalization (PCEN)☆29Updated 6 years ago
- The source code of Tim-TSENet☆12Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 6 months ago
- ☆36Updated last week
- ☆13Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆41Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆26Updated last year
- ☆48Updated 2 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆47Updated 2 months ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆40Updated 3 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 4 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated 9 months ago
- ☆29Updated 2 years ago
- Pytorch implementation of subband decomposition☆92Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- Paderborn Sound Event Detection☆73Updated last year
- acnn for text-independent speaker recognition☆9Updated 3 years ago
- ☆30Updated 8 months ago
- Self-supervised Speech Enhancement network☆11Updated 4 years ago
- ☆31Updated 2 years ago
- PyTorch implementation of LiMuSE☆30Updated 2 years ago