ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'
☆44Oct 31, 2022Updated 3 years ago
Alternatives and similar repositories for AVCleanse
Users that are interested in AVCleanse are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- INTERSPEECH2023: Target Active Speaker Detection with Audio-visual Cues☆58May 29, 2023Updated 2 years ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆92May 29, 2023Updated 2 years ago
- Toolkit for training and evaluating Self-Supervised Learning (SSL) frameworks for Speaker Verification (SV).☆37Feb 12, 2026Updated last month
- ☆159Jan 9, 2023Updated 3 years ago
- ☆11Nov 5, 2025Updated 4 months ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Jun 24, 2022Updated 3 years ago
- Official Repository For VoxBlink2☆86Aug 13, 2024Updated last year
- Code for "Distribution-based Emotion Recognition in Conversation"☆19Feb 6, 2023Updated 3 years ago
- Python scripts to create noisy and reverberant 2-speaker mixture audio with Libri-Light and WHAM☆17Nov 7, 2024Updated last year
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- [NeurIPS 2024] SD-Eval: A Benchmark Dataset for Spoken Dialogue Understanding Beyond Words☆56Jun 25, 2024Updated last year
- A ResNet Speaker Recognition&Verification Demo☆26Oct 19, 2021Updated 4 years ago
- ☆24Feb 20, 2024Updated 2 years ago
- Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)☆795Apr 11, 2024Updated last year
- The official pytorch implemention of the Intespeech 2024 paper "Reshape Dimensions Network for Speaker Recognition"☆186Sep 24, 2025Updated 5 months ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- VoxSRC2022 workshop development kit☆19Jul 21, 2022Updated 3 years ago
- ICASSP 2023: "Recursive Joint Attention for Audio-Visual Fusion in Regression Based Emotion Recognition"☆14Nov 29, 2024Updated last year
- [INTERSPEECH 2023 Best Paper Shortlist] Official implementation for MT4SSL: Boosting Self-Supervised Speech Representation Learning by In…☆45Mar 25, 2024Updated last year
- Exploring Binary Classification Loss for Speaker Verification☆18Jul 18, 2023Updated 2 years ago
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Jun 13, 2023Updated 2 years ago
- This is the implementation of the paper "Physiological-Physical Feature Fusion for Automatic Voice Spoofing Detection"☆13Jun 5, 2023Updated 2 years ago
- ☆11May 7, 2022Updated 3 years ago
- Augmentation adversarial training for self-supervised speaker recognition☆78Aug 15, 2021Updated 4 years ago
- ☆32Jun 26, 2023Updated 2 years ago
- Official repository of NeXt-TDNN for speaker verification☆80Oct 10, 2024Updated last year
- [INTERSPEECH 2024] Official pytorch code for the paper "Disentangled Representation Learning for Environment-agnostic Speaker Recognition…☆18Jul 23, 2024Updated last year
- Pytorch implementation of Extended U-Net for Speaker Verification in Noisy Environments☆28Jul 24, 2023Updated 2 years ago
- [NeurIPS 2022] "Losses Can Be Blessings: Routing Self-Supervised Speech Representations Towards Efficient Multilingual and Multitask Spee…☆17Sep 19, 2023Updated 2 years ago
- This repo is to list the references papers of 《Speaker Recognition Based on Deep Learning: An Overview》☆41Jun 26, 2021Updated 4 years ago
- A Pytorch implementation of the paper : SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification☆34Jun 25, 2021Updated 4 years ago
- Auto-KWS 2021 Challenge 1st place solution.☆11Jul 20, 2021Updated 4 years ago
- ☆16Dec 23, 2021Updated 4 years ago
- In defence of metric learning for speaker recognition☆1,164Mar 26, 2024Updated last year
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12May 13, 2024Updated last year
- The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".☆12Oct 25, 2021Updated 4 years ago
- An Open Source Tools for Speaker Recognition☆636Aug 5, 2024Updated last year
- ☆25Jan 2, 2024Updated 2 years ago
- The code repo for ICASSP 2023 Paper "MMCosine: Multi-Modal Cosine Loss Towards Balanced Audio-Visual Fine-Grained Learning"☆25May 18, 2023Updated 2 years ago