awesome-audio-visual-robustness
☆11Jan 27, 2024Updated 2 years ago
Alternatives and similar repositories for awesome-audio-visual-deepfake
Users that are interested in awesome-audio-visual-deepfake are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Implementation of "A conformer-based classifier for variable-length utterance processing in anti-spoofing" published in Interspeech 2023.☆31Nov 7, 2023Updated 2 years ago
- Pytorch implementation of "LEVERAGING POSITIONAL-RELATED LOCAL-GLOBAL DEPENDENCY FOR SYNTHETIC SPEECH DETECTION"☆38Jul 24, 2023Updated 2 years ago
- [ACM MM Award] AV-Deepfake1M: A Large-Scale LLM-Driven Audio-Visual Deepfake Dataset☆183Mar 27, 2026Updated last month
- Official Implementation and Dataset of paper - DFADD: The Diffusion and Flow-matching based Audio Deepfake Dataset☆15Apr 7, 2025Updated last year
- [ACMMM2025] Official released code for ALLM4ADD☆40Oct 30, 2025Updated 6 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Audio Research in US. US-based professors who work on audio (music, speech, acoustics). For students who would like to apply for RA, PhD,…☆27Feb 27, 2026Updated 2 months ago
- ☆10Dec 22, 2023Updated 2 years ago
- Official repository for the paper Singing Voice Graph Modeling for SingFake Detection (Interspeech 2024).☆25Sep 19, 2025Updated 7 months ago
- [ACM MM'23] UMMAFormer: A Universal Multimodal-adaptive Transformer Framework For Temporal Forgery Localization☆79Nov 12, 2024Updated last year
- ☆56Apr 4, 2026Updated last month
- SSL Layerwise analysis for speech deepfake detection☆34Aug 5, 2025Updated 9 months ago
- Can audio-visual integration strengthen robustness under multimodal attacks?☆29Mar 31, 2022Updated 4 years ago
- This repository contains the official implementation (PyTorch) of "Multimodal Forgery Detection Using Ensemble Learning" proposed in APSI…☆10Jan 4, 2023Updated 3 years ago
- Completed!!!☆11Oct 2, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Implemention of "Robust Watermarking of Neural Network with Exponential Weighting" in TensorFlow.☆13Dec 2, 2020Updated 5 years ago
- [ACM MM'24] Coarse-to-Fine Proposal Refinement Framework for Audio Temporal Forgery Detection and Localization☆41Dec 20, 2024Updated last year
- Generative Regional Editing (GRE) Benchmark☆19Sep 10, 2024Updated last year
- A deepfake audio dataset for detecting fake speech from codec-based speech synthesis systems, Interspeech 2024☆20Jul 27, 2024Updated last year
- [CVIU, DICTA Award] Glitch in the Matrix: A Large Scale Benchmark for Content Driven Audio-Visual Forgery Detection and Localization☆110Jun 24, 2025Updated 10 months ago
- Digital audio watermarker that can encode and extract secret messages from sound files. Written using MATLAB as well as implemented a pyt…☆12Oct 5, 2021Updated 4 years ago
- An unofficial pytorch implementation of the closed-source newly published work AVFF: Audio-Visual Feature Fusion for Video Deepfake Detec…☆64Aug 26, 2024Updated last year
- Qt C++实现的基于GPT 语言模型的聊天系统,支持输入输出文本处理插件☆13Jan 15, 2026Updated 3 months ago
- ICLR 2022 (Spolight): Continual Learning With Filter Atom Swapping☆16Jul 5, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Official release of pretrained models and codes for 'Golden Gemini Is All You Need: Finding the Sweet Spots for Speaker Verification'☆15Jan 20, 2025Updated last year
- Official repository for the paper Multimodal Transformer Distillation for Audio-Visual Synchronization (ICASSP 2024).☆29Apr 3, 2024Updated 2 years ago
- ☆31Jun 19, 2025Updated 10 months ago
- Estimating the Age, Height, and Gender of a speaker with their speech signal.☆14Sep 19, 2022Updated 3 years ago
- [CVPR 2023 (Highlight)] Self-Supervised Video Forensics by Audio-Visual Anomaly Detection☆110May 12, 2024Updated last year
- ☆24Sep 11, 2025Updated 7 months ago
- 哈工大2021秋计算机网络☆13Mar 30, 2023Updated 3 years ago
- This repository presents FSD dataset for song deepfake detection.☆25Aug 18, 2025Updated 8 months ago
- A decentralized electronic voting system using blockchain which helps users to cast their votes using the web portal in an efficient and …☆18May 21, 2022Updated 3 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- SpeechFake: A Large-Scale Multilingual Speech Deepfake Dataset Incorporating Cutting-Edge Generation Methods☆27Aug 13, 2025Updated 8 months ago
- [ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification☆16Mar 20, 2024Updated 2 years ago
- Implement some trivial digital watermark and test its robustness☆17Apr 12, 2019Updated 7 years ago
- Multi-Stage Face-Voice Association Learning with Keynote Speaker Diarization (ACM MM 2024)☆22Jul 25, 2024Updated last year
- This repository includes the code to reproduce our paper "RawBoost: A Raw Data Boosting and Augmentation Method applied to Automatic Spea…☆76Sep 24, 2023Updated 2 years ago
- ☆24Feb 3, 2026Updated 3 months ago
- A small C OpenCL wrapper☆17Apr 18, 2017Updated 9 years ago