xjchenGit/awesome-audio-visual-deepfake

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/xjchenGit/awesome-audio-visual-deepfake)

xjchenGit / awesome-audio-visual-deepfake

awesome-audio-visual-robustness

☆11

Alternatives and similar repositories for awesome-audio-visual-deepfake

Users that are interested in awesome-audio-visual-deepfake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

rst0070 / Rawformer-implementation-anti-spoofing
View on GitHub
Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"
☆39Jul 24, 2023Updated 3 years ago
ErosRos / conformer-based-classifier-for-anti-spoofing
View on GitHub
Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.
☆32Nov 7, 2023Updated 2 years ago
isjwdu / DFADD
View on GitHub
Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset
☆16Apr 7, 2025Updated last year
ControlNet / AV-Deepfake1M
View on GitHub
[ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset
☆188Mar 27, 2026Updated 4 months ago
yzyouzhang / Audio_Research_in_US
View on GitHub
Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…
☆27Feb 27, 2026Updated 5 months ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
xjchenGit / SingGraph
View on GitHub
Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).
☆24Sep 19, 2025Updated 10 months ago
zds-potato / multilingual-phonetic-sv
View on GitHub
☆10Dec 22, 2023Updated 2 years ago
ymhzyj / UMMAFormer
View on GitHub
[ACM MM'23] UMMAFormer: A Universal Multimodal-adaptive Transformer Framework For Temporal Forgery Localization
☆79Nov 12, 2024Updated last year
YapengTian / AV-Robustness-CVPR21
View on GitHub
Can audio-visual integration strengthen robustness under multimodal attacks?
☆30Mar 31, 2022Updated 4 years ago
ammarahhashmi / Multimodal-Forgery-Detection-Using-Ensemble-Learning
View on GitHub
This repository contains the official implementation (PyTorch) of "Multimodal Forgery Detection Using Ensemble Learning" proposed in APSI…
☆10Jan 4, 2023Updated 3 years ago
coder-backend / OCR-and-language-translation
View on GitHub
Completed!!!
☆11Oct 2, 2021Updated 4 years ago
dunky11 / exponential-weighting-watermarking
View on GitHub
Implemention of "Robust Watermarking of Neural Network with Exponential Weighting" in TensorFlow.
☆13Dec 2, 2020Updated 5 years ago
ACallglad / Digital_Audio_Watermarking-
View on GitHub
Digital audio watermarker that can encode and extract secret messages from sound files. Written using MATLAB as well as implemented a pyt…
☆12Oct 5, 2021Updated 4 years ago
zhangyong0538 / QChatGPT
View on GitHub
Qt C++实现的基于GPT 语言模型的聊天系统，支持输入输出文本处理插件
☆13Jan 15, 2026Updated 6 months ago
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
ControlNet / LAV-DF
View on GitHub
[CVIU, DICTA Award] Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization
☆115Jun 24, 2025Updated last year
ZichenMiao / CL_Atom_Swapping
View on GitHub
ICLR 2022 (Spolight): Continual Learning With Filter Atom Swapping
☆16Jul 5, 2023Updated 3 years ago
JoeLeelyf / OpenAVFF
View on GitHub
An unofficial pytorch implementation of the closed-source newly published work AVFF: Audio-Visual Feature Fusion for Video Deepfake Detec…
☆67Aug 26, 2024Updated last year
Vincent-ZHQ / MRDF
View on GitHub
Code for Cross-Modality and Within-Modality Regularization for Audio-Visual DeepFake Detection
☆43Apr 6, 2024Updated 2 years ago
xjchenGit / MTDVocaLiST
View on GitHub
Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).
☆29Apr 3, 2024Updated 2 years ago
cyaaronk / audio_deepfake_eval
View on GitHub
☆24Sep 11, 2025Updated 10 months ago
Liu-Tianchi / Golden-Gemini-for-Speaker-Verification
View on GitHub
Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'
☆15Jan 20, 2025Updated last year
Yaselley / SSL_Layerwise_Deepfake
View on GitHub
SSL Layerwise analysis for speech deepfake detection
☆36Aug 5, 2025Updated 11 months ago
Carlofkl / HIT2021ComputerNetwork
View on GitHub
哈工大2021秋计算机网络
☆13Mar 30, 2023Updated 3 years ago
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
ItzJuny / CFPRF
View on GitHub
[ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization
☆41Dec 20, 2024Updated last year
tarun360 / SpeakerProfiling
View on GitHub
Estimating the Age, Height, and Gender of a speaker with their speech signal.
☆15Sep 19, 2022Updated 3 years ago
cfeng16 / audio-visual-forensics
View on GitHub
[CVPR 2023 (Highlight)] Self-Supervised Video Forensics by Audio-Visual Anomaly Detection
☆113May 12, 2024Updated 2 years ago
xieyuankun / FSD-Dataset
View on GitHub
This repository presents FSD dataset for song deepfake detection.
☆24Aug 18, 2025Updated 11 months ago
riya-joshi-401 / Ethereum-Blockchain-based-Electronic-Voting-System
View on GitHub
A decentralized electronic voting system using blockchain which helps users to cast their votes using the web portal in an efficient and …
☆18May 21, 2022Updated 4 years ago
VicaYang / Robust-Digital-Watermark
View on GitHub
Implement some trivial digital watermark and test its robustness
☆17Apr 12, 2019Updated 7 years ago
Liu-Tianchi / Nes2Net
View on GitHub
☆111Apr 4, 2026Updated 3 months ago
xuquanfeng / qrpca
View on GitHub
☆12Nov 18, 2022Updated 3 years ago
ductuantruong / enskd
View on GitHub
[ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
☆16Mar 20, 2024Updated 2 years ago
GPU virtual machines on DigitalOcean Gradient AI • Ad
Get to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
ActivityForensics / activityforensics
View on GitHub
[CVPR 2026] ActivityForensics: ActivityForensics: A Comprehensive Benchmark for Localizing Manipulated Activity in Videos
☆23Apr 27, 2026Updated 3 months ago
Liu-Tianchi / Nes2Net_ASVspoof_ITW
View on GitHub
☆60Apr 4, 2026Updated 3 months ago
roger-tseng / CodecFake
View on GitHub
A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024
☆22Jul 27, 2024Updated 2 years ago
TaoRuijie / MFV-KSD
View on GitHub
Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)
☆22Jul 25, 2024Updated 2 years ago
OmerBenHayun / STALL
View on GitHub
[CVPR 2026] "Training-free Detection of Generated Videos via Spatial-Temporal Likelihoods"
☆20Jun 24, 2026Updated last month
TakHemlata / RawBoost-antispoofing
View on GitHub
This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…
☆78Sep 24, 2023Updated 2 years ago
ucas-hao / qwen_audio_for_add
View on GitHub
[ACMMM2025] Official released code for ALLM4ADD
☆44Oct 30, 2025Updated 8 months ago