Jielin-Qiu / MMWatermark-RobustnessLinks
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
☆10Updated last year
Alternatives and similar repositories for MMWatermark-Robustness
Users that are interested in MMWatermark-Robustness are comparing it to the libraries listed below
Sorting:
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- [DMLR 2024] Benchmarking Robustness of Multimodal Image-Text Models under Distribution Shift☆37Updated last year
- [ICLR2025 Spotlight] Advantage-Guided Distillation for Preference Alignment in Small Language Models☆21Updated 5 months ago
- Code for paper "Out-of-Domain Robustness via Targeted Augmentations"☆13Updated 2 years ago
- ☆14Updated 4 months ago
- [ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"☆81Updated last year
- ☆27Updated last year
- Code and data for "ImgTrojan: Jailbreaking Vision-Language Models with ONE Image"☆24Updated 3 months ago
- Official Implementation of Avoiding spurious correlations via logit correction☆17Updated 2 years ago
- Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"☆43Updated 4 months ago
- ☆44Updated 2 years ago
- ☆26Updated 3 months ago
- [ICML 2023] "Robust Weight Signatures: Gaining Robustness as Easy as Patching Weights?" by Ruisi Cai, Zhenyu Zhang, Zhangyang Wang☆16Updated 2 years ago
- What Makes a Reward Model a Good Teacher? An Optimization Perspective☆34Updated 3 weeks ago
- [ICML 24] A novel automated neuron explanation framework that can accurately describe poly-semantic concepts in deep neural networks☆13Updated 2 months ago
- The code for our NeurIPS 2021 paper "Kernelized Heterogeneous Risk Minimization".☆13Updated 3 years ago
- Representation Surgery for Multi-Task Model Merging. ICML, 2024.☆45Updated 9 months ago
- ☆25Updated 5 months ago
- Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"☆50Updated 6 months ago
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆13Updated last year
- Attack AlphaZero Go agents (NeurIPS 2022)☆21Updated 2 years ago
- Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics☆31Updated 3 months ago
- [NeurIPS 2023] Official repository for "Distilling Out-of-Distribution Robustness from Vision-Language Foundation Models"☆12Updated last year
- Official PyTorch implementation of "CleanCLIP: Mitigating Data Poisoning Attacks in Multimodal Contrastive Learning" @ ICCV 2023☆36Updated last year
- OODRobustBench: a Benchmark and Large-Scale Analysis of Adversarial Robustness under Distribution Shift. ICML 2024 and ICLRW-DMLR 2024☆22Updated 11 months ago
- The repository of the paper "REEF: Representation Encoding Fingerprints for Large Language Models," aims to protect the IP of open-source…☆57Updated 6 months ago
- Certified Patch Robustness via Smoothed Vision Transformers☆42Updated 3 years ago
- The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"☆37Updated last year
- Repository for research works and resources related to model reprogramming <https://arxiv.org/abs/2202.10629>☆61Updated last year
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆20Updated 3 weeks ago