YapengTian/AV-Robustness-CVPR21

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/YapengTian/AV-Robustness-CVPR21)

YapengTian / AV-Robustness-CVPR21

Can audio-visual integration strengthen robustness under multimodal attacks?

☆30

Alternatives and similar repositories for AV-Robustness-CVPR21

Users that are interested in AV-Robustness-CVPR21 are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

xjchenGit / awesome-audio-visual-deepfake
View on GitHub
awesome-audio-visual-robustness
☆11Jan 27, 2024Updated 2 years ago
shincling / discreteSeparation
View on GitHub
The demo for "Discretization and Re-synthesis: an alternative method to solve the Cocktail Party Problem".
☆12Oct 25, 2021Updated 4 years ago
MetaVQA / MetaVQA
View on GitHub
Implementation of MetaVQA.
☆12Jul 3, 2021Updated 5 years ago
vdean / audio-curiosity
View on GitHub
☆22Nov 17, 2020Updated 5 years ago
Wangtk311 / SafeEar-Inference-Test-Script
View on GitHub
SafeEar是由浙大和清华共同开发的一种深度伪声探测模型。这是我撰写的模型推理脚本。我不确定它是否正确，目前我还是初学者，如有问题请原谅我并指出，谢谢！
☆16May 16, 2025Updated last year
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
ArrayDPS / ArrayDPS
View on GitHub
☆40May 12, 2025Updated last year
martinmamql / multimodal_routing
View on GitHub
☆20Oct 23, 2022Updated 3 years ago
unbiarirang / Fixed-Input-Parameterization
View on GitHub
This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"
☆32Sep 13, 2024Updated last year
keven980716 / weak-to-strong-deception
View on GitHub
[ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"
☆15Jun 21, 2024Updated 2 years ago
MaHuanAAA / MoNIG
View on GitHub
Code for paper Trustworthy Multimodal Regression with Mixture of Normal-inverse Gamma Distributions.
☆51Nov 3, 2023Updated 2 years ago
hammlab / PoisoningCertifiedDefenses
View on GitHub
How Robust are Randomized Smoothing based Defenses to Data Poisoning? (CVPR 2021)
☆14Jul 16, 2021Updated 5 years ago
JusperLee / speechbrain-docs-zh-cn
View on GitHub
SpeechBrain中文文档
☆12Mar 20, 2021Updated 5 years ago
desh2608 / css
View on GitHub
PyTorch implementation of Continuous Speech Separation
☆12Oct 5, 2022Updated 3 years ago
OpenNLPLab / TAVGBench
View on GitHub
Demo page of TAVGBench: Benchmarking Text to Audible-Video Generation
☆15Apr 7, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
jczhang02 / MUSIC_dataset_script
View on GitHub
This repo contains script to download MUSIC dataset from youtube
☆12Jan 19, 2024Updated 2 years ago
lcn-kul / xls-r-analysis-sqa
View on GitHub
Analysis of XLS-R for Speech Quality Assessment
☆15Feb 10, 2025Updated last year
facebookresearch / learning-audio-visual-dereverberation
View on GitHub
Code for paper Learning Audio-Visual Dereverberation
☆32Aug 10, 2022Updated 3 years ago
afourast / avobjects
View on GitHub
Implementation for ECCV20 paper "Self-Supervised Learning of audio-visual objects from video"
☆114Nov 16, 2020Updated 5 years ago
weiguoPian / AV-CIL_ICCV2023
View on GitHub
[ICCV 2023] Audio-Visual Class-Incremental Learning
☆35Sep 29, 2024Updated last year
stoneMo / MGN
View on GitHub
Official implementation for MGN
☆20Dec 22, 2022Updated 3 years ago
pliang279 / factorized
View on GitHub
[ICLR 2019] Learning Factorized Multimodal Representations
☆69Aug 4, 2020Updated 5 years ago
Ding-Kexin / IF_CALC
View on GitHub
The repository contains the implementations for Coupled Adversarial Learning for Fusion Classification of Hyperspectral and LiDAR Data
☆21Jul 15, 2024Updated 2 years ago
GeWu-Lab / APPO
View on GitHub
The official repository for CVPR'26 Paper "APPO: Attention-guided Perception Policy Optimization for Video Reasoning"
☆17Mar 19, 2026Updated 4 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
roger-tseng / av-superb
View on GitHub
A Multi-Task Evaluation Benchmark for Audio-Visual Representation Models (ICASSP 2024)
☆58Apr 17, 2024Updated 2 years ago
jordipons / AudioSetOntologyTree
View on GitHub
Tree visualization of the AudioSet Ontology - https://github.com/audioset/ontology
☆18Aug 8, 2024Updated last year
liuruoyu / cross-meida-evaluation
View on GitHub
Evaluation cross-media retrieval using a new protocol.
☆11Mar 14, 2017Updated 9 years ago
Vekteur / probabilistic-calibration-study
View on GitHub
Implementation of "A Large-Scale Study of Probabilistic Calibration in Neural Network Regression" (ICML 2023)
☆11Oct 7, 2025Updated 9 months ago
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
yaohungt / Pointwise_Dependency_Neural_Estimation
View on GitHub
☆20Jun 16, 2020Updated 6 years ago
Wenjun-Peng / GPT4SM
View on GitHub
☆11Jun 7, 2023Updated 3 years ago
GeWu-Lab / Certifiable-Robust-Multi-modal-Training
View on GitHub
A python implement for Certifiable Robust Multi-modal Training
☆20Jun 21, 2025Updated last year
dmhyun / MSRP
View on GitHub
Official repository of Generating Multiple-Length Summaries via Reinforcement Learning for Unsupervised Sentence Summarization [EMNLP'22 …
☆10May 20, 2023Updated 3 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Reagan1311 / Mask2IV
View on GitHub
Mask2IV: Interaction-Centric Video Generation via Mask Trajectories (AAAI 2026)
☆17Jun 8, 2026Updated last month
aam-at / adversary_critic
View on GitHub
☆13Jun 24, 2020Updated 6 years ago
AnthonySong98 / awesome-graph-based-semi-supervised-learning
View on GitHub
A collection of resources for graph-based semi-supervised learning (GSSL).
☆20Aug 30, 2021Updated 4 years ago
FingerRec / awesome_video_self_supervised
View on GitHub
awesome video-based self-supervised learning methods in recently years
☆10Nov 26, 2020Updated 5 years ago
Bizilizi / VGGSounder
View on GitHub
VGGSounder, a multi-label audio-visual classification dataset with modality annotations.
☆17Jun 30, 2026Updated 3 weeks ago
brightjade / PRiSM
View on GitHub
Source code for paper "PRiSM: Enhancing Low-Resource Document-Level Relation Extraction with Relation-Aware Score Calibration", Findings …
☆11Jun 20, 2025Updated last year
LUMIA-Group / Leveraging-Self-Supervised-Learning-for-AVSR
View on GitHub
Official PyTorch implementation of paper Leveraging Unimodal Self Supervised Learning for Multimodal Audio-Visual Speech Recognition (ACL…
☆67Jul 13, 2022Updated 4 years ago