my-yy / vfal_papers
Voice Face Association Learning Paper List
☆15Updated last year
Alternatives and similar repositories for vfal_papers:
Users that are interested in vfal_papers are comparing it to the libraries listed below
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆40Updated 2 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆18Updated 9 months ago
- ☆18Updated last year
- ☆13Updated 9 months ago
- ☆27Updated last year
- A Compact and Effective Pretrained Model for Speech Emotion Recognition☆38Updated 9 months ago
- Pytorch implementation of RawNeXt: Speaker verification system for variable-duration utterance with deep layer aggregation and dynamic sc…☆25Updated 2 years ago
- ☆13Updated 2 years ago
- ☆46Updated 9 months ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆37Updated 4 months ago
- Official implementation of the ICASSP 2024 paper: Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verificat…☆16Updated last year
- Pytorch implementation of Diff-SV: A Unified Hierarchical Framework for Noise-Robust Speaker Verification Using Score-Based Diffusion Pro…☆23Updated last year
- ICASSP 2022: 'Self-supervised Speaker Recognition with Loss-gated Learning'☆89Updated last year
- Official implement of SpeechFormer written in Python (PyTorch).☆78Updated 2 years ago
- Framework for training and evaluating self-supervised learning methods for speaker verification.☆22Updated 2 months ago
- [ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization☆23Updated 4 months ago
- 3-D Convolutional Recurrent Neural Networks With Attention Model for Speech Emotion Recognition.☆39Updated 4 years ago
- This is the pytorch implementation of our work titled "An Efficient Temporary Deepfake Location Approach Based Embeddings for Partially S…☆18Updated 5 months ago
- This is the official repo of our work titled "The Codecfake Dataset and Countermeasures for the Universally Detection of Deepfake Audio".☆55Updated 4 months ago
- ☆17Updated last year
- ☆33Updated 5 months ago
- ☆41Updated 4 years ago
- SpeechFormer++ in PyTorch☆48Updated last year
- Learning Domain-Invariant Transformation for Speaker Verification.☆11Updated last year
- ☆23Updated 9 months ago
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆60Updated last year
- SASV2 baseline, a track on ASVspoof5 phase2 challenge☆23Updated 9 months ago
- ☆48Updated 7 months ago
- An attention-based backend allowing efficient fine-tuning of transformer models for speaker verification☆20Updated 7 months ago
- Time-domain synthetic speech detection net (TSSDNet), having the classic ResNet and Inception Net style structures (Res-TSSDNet and Inc-T…☆68Updated 3 years ago