swagshaw / Rainbow-Keywords
Rainbow Keywords - Official PyTorch Implementation
☆12Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for Rainbow-Keywords
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆31Updated 3 years ago
- ☆26Updated last year
- ☆31Updated 2 years ago
- ☆29Updated 2 years ago
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆14Updated 2 years ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆24Updated last month
- Continual Learning Benchmark for Spoken Keyword Spotting☆15Updated 2 years ago
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆19Updated last year
- Collection of PyTorch implementations of Spoken Keyword Spotting presented in research papers.☆20Updated 7 months ago
- Multi-Head-Attention RNN pytorch implement for keyword spotting☆21Updated 4 years ago
- Learning differentiable temporal resolution on time-series data.☆32Updated 2 years ago
- ☆41Updated last year
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆38Updated 2 months ago
- Exploring Binary Classification Loss for Speaker Verification☆14Updated last year
- ☆19Updated last year
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆33Updated 2 years ago
- Test Framework for few-shot open set KWS☆24Updated this week
- Implementation of CoBERT: Self-Supervised Speech Representation Learning Through Code Representation Learning☆46Updated last year
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆13Updated 3 months ago
- LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT☆69Updated 2 years ago
- Code for the Interspeech 2024 paper "MM-KWS: Multi-modal Prompts for Multilingual User-defined Keyword Spotting"☆14Updated 2 months ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆11Updated 2 years ago
- CoNeTTE: An efficient Audio Captioning system leveraging multiple datasets with Task Embedding☆13Updated last week
- Official source code of the INTERSPEECH 2023 paper: "Audio-Visual Speech Separation in Noisy Environments with a Lightweight Iterative Mo…☆19Updated last year
- ☆18Updated 2 years ago
- Source code for ICASSP2022 "Pseudo Strong labels for large scale weakly supervised audio tagging"☆30Updated 2 years ago
- A toolkit for researchers in the multimodal sound separation.☆16Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆44Updated 2 years ago
- ☆15Updated 2 years ago