jzhang538 / BadMerging
[CCS 2024] "BadMerging: Backdoor Attacks Against Model Merging": official code implementation.
☆27Updated 8 months ago
Alternatives and similar repositories for BadMerging:
Users that are interested in BadMerging are comparing it to the libraries listed below
- [ICLR 2024] Inducing High Energy-Latency of Large Vision-Language Models with Verbose Images☆33Updated last year
- (CVPR 2025) Official implementation to DELT: A Simple Diversity-driven EarlyLate Training for Dataset Distillation which outperforms SOTA…☆20Updated last month
- [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling D…☆25Updated 11 months ago
- A Simple Baseline Achieving Over 90% Success Rate Against the Strong Black-box Models of GPT-4.5/4o/o1. Paper at: https://arxiv.org/abs/2…☆55Updated last week
- [ICML 2024] Unsupervised Adversarial Fine-Tuning of Vision Embeddings for Robust Large Vision-Language Models☆128Updated 5 months ago
- Improving Your Model Ranking on Chatbot Arena by Vote Rigging☆20Updated 2 months ago
- ☆27Updated last year
- Official implementation of NeurIPS'24 paper "Defensive Unlearning with Adversarial Training for Robust Concept Erasure in Diffusion Model…☆40Updated 5 months ago
- Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"☆46Updated 3 months ago
- ☆27Updated this week
- [ICLR 2025] Dissecting Adversarial Robustness of Multimodal LM Agents☆80Updated 2 months ago
- The official implementation of ECCV'24 paper "To Generate or Not? Safety-Driven Unlearned Diffusion Models Are Still Easy To Generate Uns…☆73Updated last month
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆28Updated 6 months ago
- This is the official code for the paper "Virus: Harmful Fine-tuning Attack for Large Language Models Bypassing Guardrail Moderation"☆46Updated 2 months ago
- Unsolvable Problem Detection: Evaluating Trustworthiness of Vision Language Models☆75Updated 7 months ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆57Updated 9 months ago
- [ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …☆19Updated 6 months ago
- 🔥🔥🔥Breaking long thought processes of o1-like LLMs, such as DeepSeek-R1, QwQ☆28Updated last month
- [ICLR 2025] VideoShield: Regulating Diffusion-based Video Generation Models via Watermarking (Official Implementation)☆34Updated 2 weeks ago
- List of T2I safety papers, updated daily, welcome to discuss using Discussions☆61Updated 8 months ago
- [ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation…☆120Updated 5 months ago
- Official implementation of the paper "Neural Honeytrace: A Robust Plug-and-Play Watermarking Framework against Model Extraction Attacks"☆18Updated 3 months ago
- Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics☆23Updated last month
- This repository contains the source code, datasets, and scripts for the paper "GenderCARE: A Comprehensive Framework for Assessing and Re…☆21Updated 7 months ago
- [NeurIPS 2024] Official implementation of the paper “Ferrari: Federated Feature Unlearning via Optimizing Feature Sensitivity"☆15Updated last month
- PDM-based Purifier☆20Updated 5 months ago
- AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models☆53Updated last year
- ☆24Updated last month
- [ICML 2024] Prompting4Debugging: Red-Teaming Text-to-Image Diffusion Models by Finding Problematic Prompts (Official Pytorch Implementati…☆41Updated 4 months ago
- ☆20Updated 2 weeks ago