denfed / wave-spec-fusionLinks
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆16Updated 4 years ago
Alternatives and similar repositories for wave-spec-fusion
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
Sorting:
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated 11 months ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- ☆95Updated 4 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆20Updated 3 years ago
- ☆58Updated last year
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year
- ☆66Updated last year
- ☆38Updated 4 months ago
- ☆50Updated last year
- Discriminative Training of VBx Diarization☆26Updated 11 months ago
- ☆18Updated 3 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- MSP-Podcast Challenge Baseline Code for Interspeech 2025☆27Updated 9 months ago
- The source code of Tim-TSENet☆14Updated 3 years ago
- ☆17Updated last year
- Paderborn Sound Event Detection☆76Updated 2 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆42Updated 3 years ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆37Updated 11 months ago
- ☆56Updated 2 years ago
- ☆54Updated 5 years ago
- ☆30Updated 2 years ago
- EVAR ~ Evaluation package for Audio Representations☆64Updated 3 weeks ago
- ☆65Updated 2 years ago
- Unsupervised domain adaptation for conversational speech enhancement using RemixIT☆54Updated 2 years ago
- ☆88Updated last year
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆20Updated 9 months ago
- Official data preparation and metric evaluation scripts for the Interspeech 2025 URGENT challenge.☆67Updated 3 months ago