ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
Alternatives and similar repositories for AVCleanse
Users that are interested in AVCleanse are comparing it to the libraries listed below
Sorting:
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆92May 29, 2023Updated 2 years ago
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).☆37Feb 12, 2026Updated 2 weeks ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- ☆157Jan 9, 2023Updated 3 years ago
- ☆11Nov 5, 2025Updated 3 months ago
- ☆11May 7, 2022Updated 3 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- A ResNet Speaker Recognition&Verification Demo☆26Oct 19, 2021Updated 4 years ago
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated last year
- ☆16Dec 23, 2021Updated 4 years ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated last year
- Official Repository For VoxBlink2☆85Aug 13, 2024Updated last year
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆56Jun 25, 2024Updated last year
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆792Apr 11, 2024Updated last year
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆25May 18, 2023Updated 2 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆186Sep 24, 2025Updated 5 months ago
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- A library of speech gadgets.☆14Oct 15, 2022Updated 3 years ago
- Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".☆12Aug 28, 2023Updated 2 years ago
- official implementation of paper ExPO: Explainable Phonetic Trait-Oriented Network for Speaker Verification☆14Mar 14, 2025Updated 11 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Jun 22, 2022Updated 3 years ago
- Official repository of NeXt-TDNN for speaker verification☆80Oct 10, 2024Updated last year
- ☆32Jun 26, 2023Updated 2 years ago
- Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit☆1,218Feb 11, 2026Updated 2 weeks ago
- A implement of adaptive score normalization (AS-Norm) in speaker verification/recognition with pytorch☆10Oct 12, 2022Updated 3 years ago
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Jul 24, 2023Updated 2 years ago
- In defence of metric learning for speaker recognition☆1,165Mar 26, 2024Updated last year
- A curated list of audio-visual learning methods and datasets.☆286Dec 3, 2024Updated last year