swagshaw / Rainbow-Keywords
Rainbow Keywords - Official PyTorch Implementation
☆12Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Rainbow-Keywords
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆14Updated 2 years ago
- ☆31Updated 2 years ago
- ☆26Updated last year
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆16Updated 3 months ago
- ☆29Updated 2 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆11Updated this week
- Learning Domain-Invariant Transformation for Speaker Verification.☆9Updated last year
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆20Updated 7 months ago
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆19Updated last year
- ☆15Updated 2 years ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆24Updated 2 months ago
- SASV2 baseline, a track on ASVspoof5 phase2 challenge☆22Updated 4 months ago
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆13Updated 3 months ago
- ☆13Updated 4 months ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆14Updated 2 weeks ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 2 months ago
- ☆17Updated last month
- Test Framework for few-shot open set KWS☆25Updated 2 weeks ago
- ☆13Updated 2 weeks ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆35Updated 2 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆12Updated last year
- Contains code for Deep Self Supervised Heirarchical Clustering for Speaker Diarization☆16Updated 2 years ago
- Code for paper "Gradient Remedy for Multi-Task Learning in End-to-End Noise-Robust Speech Recognition"☆16Updated last year
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆11Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆33Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- ☆14Updated last year