FishMaster93 / U-FFIA
The audio-visual fusion method for FFIA
☆12Updated 6 months ago
Alternatives and similar repositories for U-FFIA:
Users that are interested in U-FFIA are comparing it to the libraries listed below
- ☆10Updated last year
- ☆82Updated last year
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- ☆63Updated 5 months ago
- ☆53Updated 7 months ago
- ☆81Updated last year
- ☆53Updated 4 years ago
- ☆25Updated last year
- A new comprehensive and diverse few-shot acoustic classification benchmark.☆61Updated 4 months ago
- code for sound event detection transformer (SEDT) and self-supervised pre-training SEDT (SP-SEDT)☆40Updated 2 years ago
- Code for CVSSP submission to DCASE 2021 Task 6☆35Updated 2 years ago
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆29Updated last year
- The code for DCASE2021 task5 submission.☆20Updated 3 years ago
- Audio processing by using pytorch 1D convolution network (based on nnAudio). Gammatone Spectrogram and SpecAugmentation are now available…☆20Updated 4 years ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆24Updated 2 years ago
- Masked Spectrogram Modeling using Masked Autoencoders for Learning General-purpose Audio Representations☆89Updated 8 months ago
- ☆21Updated 4 years ago
- A pytorch implementation of the paper : Acoustic Scene Classification with Multiple Decision Schemes.☆20Updated 4 years ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆36Updated last year
- Code for the TASLP paper "PSLA: Improving Audio Tagging With Pretraining, Sampling, Labeling, and Aggregation".☆142Updated last year
- The Pytorch implementation of paper: Masked Spectrogram Prediction For Self-Supervised Audio Pre-Training☆42Updated 2 months ago
- Paderborn Sound Event Detection☆72Updated last year
- TDY-CNN for text-independent speaker verification☆17Updated 2 years ago
- ☆17Updated 3 months ago
- ☆31Updated 3 months ago
- Voice Face Association Learning Paper List☆15Updated last year
- ☆31Updated this week
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆16Updated 6 months ago
- This repo includes the official implementations of "Fine-tune the pretrained ATST model for sound event detection".☆120Updated 4 months ago
- A Pytorch (support batch and channel) implementation of GoogleBrain's SpecAugment: A Simple Data Augmentation Method for Automatic Speech…☆11Updated 6 months ago