my-yy / vfal_papers
Voice Face Association Learning Paper List
☆15Updated last year
Alternatives and similar repositories for vfal_papers:
Users that are interested in vfal_papers are comparing it to the libraries listed below
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆18Updated 6 months ago
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆38Updated 2 years ago
- ☆13Updated 6 months ago
- Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…☆16Updated 10 months ago
- ☆17Updated 10 months ago
- ☆13Updated 2 years ago
- ☆144Updated 2 years ago
- ☆27Updated last year
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆47Updated last month
- [ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization☆21Updated last month
- ☆32Updated 2 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆23Updated 2 years ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆35Updated last month
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆32Updated 7 months ago
- [ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations☆36Updated last year
- This is a general framework for fake audio detection using pytorch lightning☆15Updated 3 months ago
- Official implement of SpeechFormer written in Python (PyTorch).☆77Updated last year
- ☆43Updated 6 months ago
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆37Updated 4 years ago
- ☆16Updated 2 months ago
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆89Updated last year
- Baseline system for CNVSRC2023 (Chinese Continuous Visual Speech Recognition Challenge 2023)☆21Updated 9 months ago
- ☆15Updated 2 months ago
- Baseline system for SVDD 2024 Challenge CtrSVDD track☆23Updated 2 months ago
- Implementation of the paper: Channel-wise Gated Res2Net: Towards Robust Detection of Synthetic Speech Attacks (INTERSPEECH 2021)☆30Updated 3 years ago
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆22Updated last year
- ☆22Updated 6 months ago
- Code for paper "Audio Deepfake Detection with Self-supervised XLS-R and SLS classifier☆19Updated 4 months ago
- TDY-CNN for text-independent speaker verification☆17Updated 2 years ago