swagshaw / Rainbow-KeywordsLinks
Rainbow Keywords - Official PyTorch Implementation
☆13Updated 11 months ago
Alternatives and similar repositories for Rainbow-Keywords
Users that are interested in Rainbow-Keywords are comparing it to the libraries listed below
Sorting:
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Updated 3 years ago
- ☆30Updated last year
- ☆18Updated last month
- Exploring Binary Classification Loss for Speaker Verification☆16Updated last year
- This is a public repository for RATS Channel-A Speech Data, which is a chargeable noisy speech dataset under LDC. Here we release its Log…☆15Updated 2 years ago
- WildDESED: A LLM-Powered Dataset for Wild Domestic Environment Sound Event Detection☆15Updated 6 months ago
- Test-time adaptation for speech recognition model by single utterance. The official implementation of "Listen, Adapt, Better WER: Source-…☆19Updated 3 years ago
- ☆29Updated 2 years ago
- ☆30Updated 6 months ago
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆23Updated last year
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆17Updated 2 years ago
- ☆32Updated 2 years ago
- Learning differentiable temporal resolution on time-series data.☆36Updated 2 years ago
- Official PyTorch implementation of "t-EER: Parameter-Free Tandem Evaluation Metric of Countermeasures and Biometric Comparators"☆12Updated last year
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Updated last year
- Boosting Self-Supervised Embeddings for Speech Enhancement☆47Updated 2 years ago
- (Interspeech 2023 & ICASSP 2024) Official repository for ARMHuBERT and STaRHuBERT☆40Updated 9 months ago
- ☆10Updated 2 years ago
- ☆15Updated 2 years ago
- ☆13Updated 2 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆16Updated 7 months ago
- ☆43Updated 2 years ago
- Vox-Profile Benchmark☆25Updated 2 weeks ago
- The source code of Tim-TSENet☆12Updated 3 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Updated last year
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆13Updated last month
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆12Updated 7 months ago
- Code for paper "Unifying Speech Enhancement and Separation with Gradient Modulation for End-to-End Noise-Robust Speech Separation"☆42Updated 10 months ago
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆11Updated 2 years ago
- ☆32Updated 2 years ago