kaen2891 / bts
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"
☆19Updated 4 months ago
Alternatives and similar repositories for bts:
Users that are interested in bts are comparing it to the libraries listed below
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…☆13Updated 4 months ago
- This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models☆56Updated last month
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆29Updated last year
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆64Updated last month
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆17Updated 4 months ago
- ☆18Updated 3 years ago
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆77Updated 4 years ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆26Updated last year
- Pytorch implementation of the paper : Modeling Label Dependencies for Audio Tagging with Graph Convolutional Network☆13Updated 4 years ago
- Code repo for "Multi-Task Learning for Interpretable Weakly Labelled Sound Event Detection"☆16Updated 2 years ago
- ☆62Updated 6 months ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆90Updated 9 months ago
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆62Updated 6 months ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆91Updated 8 months ago
- Dataset and baseline code for the VocalSound dataset (ICASSP2022).☆133Updated 2 years ago
- ☆41Updated 4 years ago
- ☆12Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Official Pytorch Implementation for Continual Learning For On-Device Environmental Sound Classification☆14Updated 2 years ago
- ☆53Updated 4 years ago
- ☆26Updated 2 years ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆59Updated 9 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆143Updated last year
- ☆26Updated 2 years ago
- Repository for the paper "Towards duration robust weakly supervised sound event detection"☆23Updated last year
- ☆12Updated 2 years ago
- ☆86Updated last year
- Code for paper "Dual-Path Style Learning for End-to-End Noise-Robust Speech Recognition"☆41Updated last year
- 2022 DCASE Challenge☆12Updated 6 months ago
- Implementation of the paper "Attentive Statistics Pooling for Deep Speaker Embedding" in Pytorch☆43Updated 4 years ago