denfed / wave-spec-fusionLinks
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆16Updated 4 years ago
Alternatives and similar repositories for wave-spec-fusion
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
Sorting:
- A Diffusion Probabilistic Model for Target Sound Extraction☆41Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 3 years ago
- ☆59Updated last year
- ☆103Updated 6 months ago
- ☆54Updated 5 years ago
- Paderborn Sound Event Detection☆76Updated 2 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year
- ☆18Updated 3 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆40Updated 2 years ago
- ☆30Updated 4 years ago
- An Improved Event-Independent Network for Polyphonic Sound Event Localization and Detection☆73Updated 4 years ago
- Pytorch implementation of subband decomposition☆92Updated 3 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- EVAR ~ Evaluation package for Audio Representations☆68Updated last month
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆75Updated 5 months ago
- ☆38Updated 6 months ago
- ☆66Updated 2 years ago
- ☆58Updated 2 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 4 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆21Updated 3 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆55Updated 2 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆32Updated 11 months ago
- ☆89Updated last year
- Pytorch port of Google Research's LEAF Audio paper☆93Updated 4 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆39Updated last year
- Domestic environment sound event detection task☆150Updated last year
- Data simulation scripts for paper "Target Sound Extraction with Variable Cross-modality Clues"☆16Updated 2 years ago