denfed / wave-spec-fusionLinks
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆16Updated 4 years ago
Alternatives and similar repositories for wave-spec-fusion
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
Sorting:
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated last year
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- ☆18Updated 3 years ago
- ☆96Updated 4 months ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- Paderborn Sound Event Detection☆76Updated 2 years ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆72Updated 4 months ago
- ☆38Updated 4 months ago
- ☆30Updated 2 years ago
- ☆54Updated 5 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆73Updated 4 years ago
- ☆58Updated last year
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆42Updated 3 years ago
- Discriminative Training of VBx Diarization☆26Updated last year
- EVAR ~ Evaluation package for Audio Representations☆65Updated 2 weeks ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆20Updated 3 years ago
- PyTorch implementation of the LEAF audio frontend☆73Updated 2 years ago
- ☆65Updated 2 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45Updated 3 years ago
- ☆57Updated 2 years ago
- MultiSV: scripts for data preparation☆27Updated 8 months ago
- MANNER: Multi-view Attention Network for Noise ERasure (Speech enhancement in time-domain)☆63Updated 3 years ago
- ☆50Updated last year
- Official implement of "Dual-stream Time-Delay Neural Network with Dynamic Global Filter for Speaker Verification" in PyTorch☆41Updated 2 years ago
- ☆58Updated last year