kaen2891 / btsLinks
(INTERSPEECH 2024) Official Implementation of "BTS: Bridging Text and Sound Modalities for Metadata-Aided Respiratory Sound Classification"
☆21Updated last month
Alternatives and similar repositories for bts
Users that are interested in bts are comparing it to the libraries listed below
Sorting:
- This is the official code release for OPERA: OPEn Respiratory Acoustic foundation models☆57Updated 2 months ago
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…☆15Updated 6 months ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆30Updated last year
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆67Updated 2 months ago
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆18Updated 6 months ago
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆144Updated last year
- ☆88Updated last year
- SpecAugment: A Simple Data Augmentation Method for Automatic Speech Recognition☆80Updated 4 years ago
- This is the official implementation of the work RespireNet.☆47Updated 4 years ago
- ☆26Updated 2 years ago
- Masked Modeling Duo: Towards a Universal Audio Pre-training Framework☆101Updated 10 months ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆91Updated 11 months ago
- PEFT-SER: On the Use of Parameter Efficient Transfer Learning Approaches For Speech Emotion Recognition Using Pre-trained Speech Models (…☆59Updated 11 months ago
- Heart Murmur Detection from Phonocardiogram Recordings: The George B. Moody PhysioNet Challenge 2022☆11Updated last month
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆26Updated last year
- Public Code for the paper MAE-AST: Masked Autoencoding Audio Spectrogram Transformer☆87Updated 2 years ago
- Official implement of SpeechFormer written in Python (PyTorch).☆80Updated 2 years ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆131Updated this week
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 10 months ago
- SpeechFormer++ in PyTorch☆48Updated last year
- ☆18Updated 3 years ago
- ☆65Updated 8 months ago
- ☆18Updated 4 years ago
- ☆12Updated 4 years ago
- Trustworthy Speech Emotion Recognition☆13Updated 2 years ago
- Emotion Recognition ToolKit (ERTK): tools for emotion recognition. Dataset processing, feature extraction, experiments,☆58Updated 6 months ago
- Dual Bayesian ResNet: A Deep Learning Approach to Heart Murmur Detection (Physionet Challenge 2022)☆19Updated last year
- [IJCAI 2024] EAT: Self-Supervised Pre-Training with Efficient Audio Transformer☆153Updated last month
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆37Updated last year
- ☆12Updated 4 years ago