denfed / wave-spec-fusion
Code for the submitted 2021 DCASE Workshop paper: "Waveforms and Spectrograms: Enhancing Acoustic Scene Classification Using Multimodal Feature Fusion"
☆12Updated 3 years ago
Alternatives and similar repositories for wave-spec-fusion:
Users that are interested in wave-spec-fusion are comparing it to the libraries listed below
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- System that ranks 2nd in DCASE 2022 Challenge Task 5: Few-shot Bioacoustic Event Detection☆28Updated 2 years ago
- ☆18Updated 2 years ago
- Dynamic Mixing For Speech Processing (mix-on-the-fly)☆17Updated 2 years ago
- A Diffusion Probabilistic Model for Target Sound Extraction☆36Updated 4 months ago
- Pytorch implementation of the paper : A Global-local Attention Framework for Weakly Labelled Audio Tagging.☆13Updated 4 years ago
- ☆16Updated 4 years ago
- ☆23Updated 4 months ago
- The source code of Tim-TSENet☆12Updated 2 years ago
- ☆29Updated 7 months ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆23Updated 11 months ago
- Adapting a ConvNeXt model to audio classification on AudioSet☆23Updated this week
- Official repo for "A MODULATION-DOMAIN LOSS FOR NEURAL-NETWORK-BASED REAL-TIME SPEECH ENHANCEMENT" to appear in ICASSP 2021☆38Updated 3 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆40Updated 2 years ago
- ☆13Updated 2 years ago
- VoViT: Low Latency Graph-based Audio-Visual VoiceSeparation Transformer☆34Updated last year
- ☆53Updated 4 years ago
- Implementation of "A Deep Learning Loss Function based on Auditory Power Compression for Speech Enhancement" by pytorch☆28Updated 3 years ago
- PyTorch implementation of LiMuSE☆30Updated 2 years ago
- My system for the DCASE 2022 Task 3 Sound Event Localizaiton and Detection.☆12Updated 2 years ago
- Implementation for paper: Multi-Metric Optimization using Generative Adversarial Networks for Near-End Speech Intelligibility Enhancement☆20Updated 3 years ago
- MSP-Podcast Challenge Baseline Code☆20Updated 8 months ago
- unofficial implementation of "CPTNN: CROSS-PARALLEL TRANSFORMER NEURAL NETWORK FOR TIME-DOMAIN SPEECH ENHANCEMENT"☆15Updated last year
- ☆25Updated last year
- Official implementation of EfficientLEAF, a learnable audio frontend.☆40Updated 2 years ago
- ☆63Updated 5 months ago
- ☆15Updated 2 years ago