Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)
☆72Mar 11, 2025Updated 11 months ago
Alternatives and similar repositories for patch-mix_contrastive_learning
Users that are interested in patch-mix_contrastive_learning are comparing it to the libraries listed below
Sorting:
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆31Feb 4, 2024Updated 2 years ago
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…☆17Dec 5, 2024Updated last year
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆19Dec 5, 2024Updated last year
- ☆19Nov 20, 2021Updated 4 years ago
- This is the official implementation of the work RespireNet.☆52Dec 8, 2020Updated 5 years ago
- (INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classificatio…☆25Jul 10, 2025Updated 7 months ago
- This repository contains the released respiratory sound database for IEEE BioCAS Respiratory Sound Track Challenges.☆62Dec 19, 2025Updated 2 months ago
- This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models☆72Mar 11, 2025Updated 11 months ago
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆13Oct 22, 2024Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Aug 29, 2024Updated last year
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆136Feb 23, 2026Updated last week
- Code for the AAAI 2022 paper "SSAST: Self-Supervised Audio Spectrogram Transformer".☆414Aug 14, 2022Updated 3 years ago
- Attention-based Hybrid CNN-LSTM and Spectral Data Augmentation for COVID-19 Diagnosis from Cough Sound☆36Aug 31, 2022Updated 3 years ago
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆91Jun 9, 2022Updated 3 years ago
- Dual Bayesian ResNet: A Deep Learning Approach to Heart Murmur Detection (Physionet Challenge 2022)☆23Oct 1, 2025Updated 5 months ago
- RespireNet is an innovative web-based application that harnesses the capabilities of deep learning and Mel-frequency cepstral coefficient…☆10Aug 2, 2023Updated 2 years ago
- COVID19 Sounds Dataset Supplementary Material☆21Oct 21, 2021Updated 4 years ago
- Keras implementation of Noise Masking RNN for respiratory sound classification☆23May 21, 2018Updated 7 years ago
- Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model☆26Apr 26, 2023Updated 2 years ago
- A lightwight Framework for the Respiratory Sound Classification☆11Feb 12, 2025Updated last year
- MusicYOLO framework uses the object detection model, YOLOx, to locate notes in the spectrogram.☆11Jan 29, 2022Updated 4 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆100Feb 20, 2026Updated last week
- This sample includes simeple CNN classifier for music and audio-folder dataloader just like ImageFolder in torchvision.☆11Oct 30, 2018Updated 7 years ago
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆12Mar 10, 2025Updated 11 months ago
- Data repository of Project Coswara☆200Jun 23, 2023Updated 2 years ago
- ☆17Aug 9, 2024Updated last year
- ☆12May 30, 2023Updated 2 years ago
- ☆17Nov 15, 2021Updated 4 years ago
- The official code repo of "HTS-AT: A Hierarchical Token-Semantic Audio Transformer for Sound Classification and Detection"☆474Sep 18, 2025Updated 5 months ago
- EVAR ~ Evaluation package for Audio Representations☆74Feb 19, 2026Updated 2 weeks ago
- ☆68Sep 13, 2024Updated last year
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆149Jul 13, 2023Updated 2 years ago
- Contrastive language-audio pretraining for bioacoustics☆23Oct 17, 2023Updated 2 years ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆38Jul 31, 2024Updated last year
- This repository contains the code related to the paper 'DENet: a deep architecture for audio surveillance applications'.☆42Jul 23, 2023Updated 2 years ago
- Efficient Training of Audio Transformers with Patchout☆370Jan 12, 2024Updated 2 years ago
- ☆18Apr 12, 2021Updated 4 years ago
- ☆19Jul 15, 2022Updated 3 years ago
- Library of TensorFlow layers for audio data processing and data augmentation☆20Mar 5, 2022Updated 4 years ago