vinceasvp / meta-sc
☆11Updated last year
Alternatives and similar repositories for meta-sc:
Users that are interested in meta-sc are comparing it to the libraries listed below
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆14Updated 4 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆26Updated last year
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆16Updated 3 months ago
- ☆19Updated 3 weeks ago
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆16Updated last year
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆18Updated 3 years ago
- MTDA-HSED: Mutual-Assistance Tuning and Dual-Branch Aggregating for Heterogeneous Sound Event Detection☆9Updated 6 months ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Updated last year
- Rainbow Keywords - Official PyTorch Implementation☆12Updated 9 months ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- ☆24Updated 5 months ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆15Updated 2 years ago
- [ICLR 2025] Enhancing Self-Supervised Models with Audio Mixtures for Polyphonic Soundscapes☆18Updated 3 weeks ago
- ☆14Updated last year
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆17Updated 3 months ago
- Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models☆14Updated 11 months ago
- ☆9Updated last year
- [AAAI 2024] DTF-AT: Decoupled Time-Frequency Audio Transformer for Event Classification☆10Updated 3 weeks ago
- The code for DCASE2021 task5 submission.☆20Updated 3 years ago
- Noise-Aware Speech Separation with Contrastive Learning☆17Updated 11 months ago
- AudioLDM training, finetuning, evaluation and inference.☆14Updated last year
- Python implementation of the paper "Fusion of Audio and Visual Embeddings for Sound Event Localization and Detection"☆20Updated 11 months ago
- A python implementation of “Self-Supervised Learning of Spatial Acoustic Representation with Cross-Channel Signal Reconstruction and Mult…☆36Updated 5 months ago
- ☆16Updated last month
- MFF-EINV2: Multi-scale Feature Fusion across Spectral-Spatial-Temporal Domains for Sound Event Localization and Detection☆10Updated 8 months ago
- This is the official repository of the papers "Parameter-Efficient Transfer Learning of Audio Spectrogram Transformers" and "Efficient Fi…☆36Updated 8 months ago
- ☆18Updated 3 years ago
- ☆25Updated 2 years ago
- This repository contains the code of the CP JKU submission to DCASE23 Task 1 "Low-complexity Acoustic Scene Classification"☆27Updated last year
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆19Updated last year