msaadsaeed / SBNetLinks
Official implementation of SBNet as described in "Single-branch Network for Multimodal Training".
☆11Updated 2 years ago
Alternatives and similar repositories for SBNet
Users that are interested in SBNet are comparing it to the libraries listed below
Sorting:
- This repository contains the speaker labeled information of VoxCeleb2 and LRS3 audio-visual datasets. (AAAI 2025)☆11Updated 11 months ago
- Implementation of Attack Agnostic Dataset: Towards Generalization and Stabilization of Audio DeepFake Detection paper☆61Updated 2 years ago
- Official implementation of RAVEn (ICLR 2023) and BRAVEn (ICASSP 2024)☆68Updated 6 months ago
- Voice Face Association Learning Paper List☆16Updated 2 years ago
- PyTorch implementation of "Lip to Speech Synthesis with Visual Context Attentional GAN" (NeurIPS2021)☆25Updated last year
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆24Updated last year
- Baselines for IS25 Source Tracing Special Session☆29Updated 7 months ago
- Official implementation of the Odyssey paper "A Probabilistic Fusion Framework for Spoofing Aware Speaker Verification"☆18Updated 3 years ago
- [T-IFS'24] Audio Multi-view Spoofing Detection Framework Based on Audio-Text-Emotion Correlations☆25Updated last year
- [ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization☆31Updated 8 months ago
- Implementation of "SpecRNet: Towards Faster and More Accessible Audio DeepFake Detection" paper☆35Updated 2 years ago
- ☆16Updated last year
- (SLT 2024) Learning Video Temporal Dynamics with Cross-Modal Attention for Robust Audio-Visual Speech Recognition☆12Updated 10 months ago
- ☆46Updated 2 years ago
- ☆23Updated last year
- ICASSP 2023: 'Speaker recognition with two-step multi-modal deep cleansing'☆44Updated 2 years ago
- ☆90Updated 3 years ago
- Self-Supervised Speech/Sound Pre-training and Representation Learning Toolkit☆13Updated 2 years ago
- ☆19Updated 2 years ago
- deep-learning based audio-visual lip bometrics☆14Updated 2 years ago
- Official implementation of the INTERSPEECH 2024 paper: Temporal-Channel Modeling in Multi-head Self-Attention for Synthetic Speech Detect…☆44Updated 8 months ago
- SSL Layerwise analysis for speech deepfake detection☆23Updated 3 weeks ago
- Region-Based Optimization in Continual Learning for Audio Deepfake Detection☆10Updated 8 months ago
- Source code and speech samples for the DSU-AVO paper accepted to INTERSPEECH 2023☆12Updated last year
- Official implementation of FOP method as described in "Fusion and Orthogonal Projection for Improved Face-Voice Association"☆19Updated last year
- ☆27Updated last year
- ☆23Updated last year
- Dynamic vision-guided speaker embedding for audio-visual speaker diarization☆11Updated 3 years ago
- Implementation of "Defense against Adversarial Attacks on Audio DeepFake Detection"☆54Updated last year
- Continual Learning Method RAWM for ICML 2023☆23Updated 11 months ago