denfed / wave-spec-fusionLinks
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆15Updated 3 years ago
Alternatives and similar repositories for wave-spec-fusion
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
Sorting:
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆38Updated 8 months ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Updated 2 years ago
- ☆18Updated 2 years ago
- ☆65Updated 8 months ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆41Updated 2 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆13Updated 11 months ago
- ☆33Updated 3 weeks ago
- ☆53Updated 5 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆72Updated 3 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 4 years ago
- 2022 DCASE Challenge☆12Updated 8 months ago
- ☆50Updated 3 weeks ago
- ICASSP2025Dynamic Embedding Causal Target Speech Extraction☆3Updated 2 months ago
- ☆24Updated 7 months ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆44Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆37Updated 7 months ago
- ☆50Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 3 years ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- Official code of "DeFT-AN: Dense Frequency-Time Attentive Network for Multichannel Speech Enhancement, IEEE Signal Processing Letters, 20…☆28Updated 7 months ago
- Understanding Audio Features via Trainable Basis Functions☆9Updated 3 years ago
- ☆31Updated 6 months ago
- ☆19Updated last year
- The source code of Tim-TSENet☆12Updated 3 years ago
- ☆26Updated last year
- Code for the paper: "Leveraging speaker attribute information using multi task learning for speaker verification and diarization" present…☆25Updated 2 years ago