SaFoLab-WISC / FIUBenchLinks

A Task of Fictitious Unlearning for VLMs

☆24

Alternatives and similar repositories for FIUBench

Users that are interested in FIUBench are comparing it to the libraries listed below

Sorting:

UCSC-VLAA / vllm-safety-benchmark
[ECCV 2024] Official PyTorch Implementation of "How Many Unicorns Are in This Image? A Safety Evaluation Benchmark for Vision LLMs"
☆83Updated last year
ys-zong / VLGuard
[ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.
☆78Updated 10 months ago
gyhdog99 / ECSO
ECSO (Make MLLM safe without neither training nor any external models!) (https://arxiv.org/abs/2403.09572)
☆33Updated last year
xirui-li / MOSSBench
An implementation for MLLM oversensitivity evaluation
☆16Updated last year
MartinPawelczyk / In-Context-Unlearning
"In-Context Unlearning: Language Models as Few Shot Unlearners". Martin Pawelczyk, Seth Neel* and Himabindu Lakkaraju*; ICML 2024.
☆28Updated 2 years ago
fangjf1 / OpenSafeMLRM
The first toolkit for MLRM safety evaluation, providing unified interface for mainstream models, datasets, and jailbreaking methods!
☆14Updated 7 months ago
DripNowhy / ETA
[ICLR 2025] PyTorch Implementation of "ETA: Evaluating Then Aligning Safety of Vision Language Models at Inference Time"
☆28Updated 4 months ago
franciscoliu / MLLMU-Bench
[NAACL 2025 Main] Official Implementation of MLLMU-Bench
☆43Updated 8 months ago
pipilurj / MLLM-protector
The official repository for paper "MLLM-Protector: Ensuring MLLM’s Safety without Hurting Performance"
☆44Updated last year
swj0419 / muse_bench
☆30Updated 8 months ago
YitingQu / unsafe-diffusion
☆38Updated last year
alenai97 / PEFT-MLLM
Official Code and data for ACL 2024 finding, "An Empirical Study on Parameter-Efficient Fine-Tuning for MultiModal Large Language Models"
☆23Updated last year
sail-sg / AnyDoor
AnyDoor: Test-Time Backdoor Attacks on Multimodal Large Language Models
☆60Updated last year
chuhac / Reasoning-to-Defend
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆11Updated 2 months ago
itsvaibhav01 / Immune
[CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment
☆25Updated 5 months ago
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆96Updated last year
ybwang119 / Awesome-reasoning-safety
This repo is for the safety topic, including attacks, defenses and studies related to reasoning and RL
☆52Updated 2 months ago
ChengshuaiZhao0 / The-Wolf-Within
☆12Updated 4 months ago
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆86Updated last year
SaFoLab-WISC / AdaShield
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆66Updated last year
inspire-group / tta_risk
☆14Updated 2 years ago
JTWang2000 / FreeShap
Fine-tuning-free Shapley value (FreeShap) for instance attribution
☆14Updated last year
umd-huang-lab / VLM-Poisoning
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
☆56Updated 10 months ago
YukeHu / vlm_mia
Code for paper "Membership Inference Attacks Against Vision-Language Models"
☆20Updated 9 months ago
erfanshayegani / Jailbreak-In-Pieces
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆75Updated last year
IBM / SafeLoRA
Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"
☆22Updated 2 months ago
KID-22 / LLM-Unlearning-Paper-List
☆28Updated last year
OPTML-Group / SOUL
Official repo for EMNLP'24 paper "SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning"
☆28Updated last year
shengliu66 / VTI
Code for Reducing Hallucinations in Vision-Language Models via Latent Space Steering
☆88Updated 11 months ago
WangCheng0116 / Awesome-LRMs-Safety
Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …
☆80Updated 2 months ago