vinceasvp / meta-sc
☆11Updated last year
Related projects ⓘ
Alternatives and complementary repositories for meta-sc
- This is a general framework for fake audio detection using pytorch lightning☆11Updated last month
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆15Updated 2 years ago
- Noise-Aware Speech Separation with Contrastive Learning☆16Updated 7 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆14Updated 2 weeks ago
- (ICASSP 2024) Official Implementation of "Stethoscope-guided Supervised Contrastive Learning for Cross-domin Adaptation on Respiratory So…☆12Updated 9 months ago
- (NeurIPS 2023 Workshop on DGM4H) Official Implementation of "Adversarial Fine-tuning using Generated Respiratory Sound to Address Class I…☆17Updated 11 months ago
- ☆14Updated last year
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆14Updated 2 years ago
- Rainbow Keywords - Official PyTorch Implementation☆12Updated 4 months ago
- Official github page of Oceanship Dataset☆14Updated 5 months ago
- ASiT: Audio Spectrogram vIsion Transformer for General Audio Representation☆21Updated 8 months ago
- Baseline method for audio-visual sound event localization and detection task of DCASE 2023 challenge☆40Updated last year
- PyTorch implementation of "Watch or Listen: Robust Audio-Visual Speech Recognition with Visual Corruption Modeling and Reliability Scorin…☆13Updated 7 months ago
- ☆12Updated 2 years ago
- Trustworthy Speech Emotion Recognition☆13Updated last year
- The code for DCASE2021 task5 submission.☆20Updated 2 years ago
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆16Updated last year
- This repository collects papers related to Speech Tokenizer.☆15Updated last month
- Cross-Speaker Encoding Network for Multi-talker Speech Recognition☆11Updated 3 months ago
- Code for paper "Cross-Modal Global Interaction and Local Alignment for Audio-Visual Speech Recognition"☆19Updated last year
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆13Updated 3 months ago
- AudioLDM training, finetuning, evaluation and inference.☆13Updated 7 months ago
- Audio-Visual Generalized Zero-Shot Learning using Large Pre-Trained Models☆13Updated 7 months ago
- Implementation for "SoundCLR: Contrastive Learning of Representations For Improved Environmental Sound Classification," in pytorch.☆24Updated 10 months ago
- ☆13Updated 2 weeks ago
- [T-IFS] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations☆12Updated 3 months ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆11Updated this week
- PyTorch implementation of "Nextformer: A ConvNeXt Augmented Conformer For End-To-End Speech Recognition"☆11Updated last year
- ☆30Updated last week
- Full-frequency dynamic convolution: a physical frequency-dependent convolution for sound event detection☆16Updated 3 months ago