denfed / wave-spec-fusionLinks
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆16Updated 4 years ago
Alternatives and similar repositories for wave-spec-fusion
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
Sorting:
- A Diffusion Probabilistic Model for Target Sound Extraction☆40Updated last year
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆99Updated 5 months ago
- Paderborn Sound Event Detection☆76Updated 2 years ago
- EVAR ~ Evaluation package for Audio Representations☆65Updated 2 weeks ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 3 years ago
- Official implementation of EfficientLEAF, a learnable audio frontend.☆47Updated 2 years ago
- ☆18Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 3 years ago
- ☆38Updated 5 months ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆65Updated last year
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆28Updated last year
- A library built for easier audio self-supervised training, downstream tasks evaluation☆131Updated last month
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆33Updated 4 years ago
- SLT 2024 Mandarin Stuttering Event Detection and Automatic Speech Recognition Challenge☆12Updated last year
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 4 years ago
- Author's repository for reproducing DcaseNet, an integrated pre-trained DNN that performs acoustic scene classification, audio tagging, a…☆42Updated 3 years ago
- Official repository for the paper "Audio xLSTMs: Learning Self-supervised audio representations with xLSTMs"☆21Updated last month
- ☆66Updated last year
- Discriminative Training of VBx Diarization☆26Updated last year
- Data generator for creating synthetic audio mixtures suitable for DCASE Challenge 2022 Task 3☆40Updated 2 years ago
- Pytorch port of Google Research's LEAF Audio paper☆93Updated 4 years ago
- PyTorch implementation of the LEAF audio frontend☆75Updated 2 years ago
- ☆54Updated 5 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆45Updated 3 years ago
- SERAB: a multi-lingual benchmark for speech emotion recognition☆28Updated 2 years ago
- ☆23Updated last year
- experiments about AudioSet☆44Updated 2 years ago
- The source code of Tim-TSENet☆14Updated 3 years ago
- ☆58Updated 2 years ago