vinceasvp / meta-sc
☆10Updated last year
Alternatives and similar repositories for meta-sc:
Users that are interested in meta-sc are comparing it to the libraries listed below
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆17Updated 2 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆12Updated 2 months ago
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…☆13Updated last month
- Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models☆14Updated 9 months ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆10Updated last year
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆17Updated last month
- MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection☆9Updated 4 months ago
- MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection☆8Updated 6 months ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆13Updated 6 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆22Updated 10 months ago
- This is a general framework for fake audio detection using pytorch lightning☆15Updated 3 months ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆14Updated 9 months ago
- Rainbow Keywords - Official PyTorch Implementation☆12Updated 7 months ago
- Official github page of Oceanship Dataset☆19Updated 7 months ago
- This repository collects papers related to Speech Tokenizer.☆15Updated 3 months ago
- The code for DCASE2021 task5 submission.☆20Updated 2 years ago
- ☆31Updated 2 years ago
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆16Updated 5 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆14Updated last month
- Learning differentiable temporal resolution on time-series data.☆35Updated 2 years ago
- ☆22Updated 3 months ago
- Patch-Mix Contrastive Learning with Audio Spectrogram Transformer on Respiratory Sound Classification (INTERSPEECH 2023)☆62Updated last month
- PyTorch implementation of our work: Pretraining Respiratory Sound Representations using Metadata and Contrastive Learning (WASPAA 2023)☆28Updated 11 months ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆19Updated last year
- ☆14Updated last year
- ☆16Updated 2 months ago
- Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"☆20Updated 9 months ago
- A unified framework for Low-resource Audio Processing and Evaluation (SSL Pre-training and Downstream Fine-tuning)☆27Updated 6 months ago
- Test Framework for few-shot open set KWS☆25Updated 2 months ago
- ☆12Updated 2 years ago