kaen2891 / bts
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"
☆16Updated 3 months ago
Alternatives and similar repositories for bts:
Users that are interested in bts are comparing it to the libraries listed below
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…☆13Updated 3 months ago
- This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models☆49Updated 3 months ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆29Updated last year
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆64Updated 3 months ago
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆17Updated 3 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆23Updated 11 months ago
- ☆81Updated last year
- Trustworthy Speech Emotion Recognition☆13Updated last year
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 7 months ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆36Updated last year
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆61Updated 5 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆86Updated 7 months ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆16Updated 2 years ago
- ☆26Updated 2 years ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆142Updated last year
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆58Updated 8 months ago
- ☆12Updated last year
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆89Updated 8 months ago
- ☆53Updated 4 years ago
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 2 months ago
- ☆62Updated 5 months ago
- ☆24Updated 2 years ago
- ☆18Updated 3 years ago
- Submission for task 2 "First-Shot Unsupervised Anomalous Sound Detection for Machine Condition Monitoring" of the DCASE challenge 2023 (h…☆13Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆75Updated 4 years ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆40Updated 2 years ago
- The code for DCASE2021 task5 submission.☆20Updated 3 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆30Updated last year
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆129Updated 2 years ago