thu-ml / MLA-TrustLinks

A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions through 34 interactive tasks

☆57

Alternatives and similar repositories for MLA-Trust

Users that are interested in MLA-Trust are comparing it to the libraries listed below

Sorting:

AI45Lab / VLSBench
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆53Updated 4 months ago
thu-ml / MMTrustEval
A toolbox for benchmarking trustworthiness of multimodal large language models (MultiTrust, NeurIPS 2024 Track Datasets and Benchmarks)
☆170Updated 5 months ago
SaFoLab-WISC / AdaShield
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆68Updated last year
ZZZhr-1 / Robust_GUI_Grounding
On the Robustness of GUI Grounding Models Against Image Attacks
☆12Updated 7 months ago
ZhangHangTao / Awesome-Embodied-AI-Safety
Focused on the safety and security of Embodied AI
☆81Updated last month
AI4Good24 / PsySafe
☆50Updated 9 months ago
yjyddq / RiOSWorld
[NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
☆47Updated 2 weeks ago
wonderNefelibata / Awesome-LRM-Safety
Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …
☆78Updated this week
thu-ml / STAIR
Official codebase for "STAIR: Improving Safety Alignment with Introspective Reasoning"
☆85Updated 9 months ago
isXinLiu / MM-SafetyBench
Accepted by ECCV 2024
☆177Updated last year
EchoseChen / SPA-VL-RLHF
The reinforcement learning codes for dataset SPA-VL
☆42Updated last year
erfanshayegani / Jailbreak-In-Pieces
[ICLR 2024 Spotlight 🔥 ] - [ Best Paper Award SoCal NLP 2023 🏆] - Jailbreak in pieces: Compositional Adversarial Attacks on Multi-Modal…
☆76Updated last year
zihao-ai / unthinking_vulnerability
To Think or Not to Think: Exploring the Unthinking Vulnerability in Large Reasoning Models
☆32Updated 6 months ago
NY1024 / RACE
☆25Updated 8 months ago
thunxxx / MLLM-Jailbreak-evaluation-MMJ-Bench
☆66Updated 7 months ago
huanranchen / VLMTransfer
A package that achieves 95%+ transfer attack success rate against GPT-4
☆24Updated last year
RUCAIBox / HADES
[ECCV'24 Oral] The official GitHub page for ''Images are Achilles' Heel of Alignment: Exploiting Visual Vulnerabilities for Jailbreaking …
☆33Updated last year
AI45Lab / IS-Bench
Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks
☆31Updated this week
eric-ai-lab / MSSBench
[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"
☆30Updated 5 months ago
WangCheng0116 / Awesome-LRMs-Safety
Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …
☆80Updated 3 months ago
shengyin1224 / SafeAgentBench
Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"
☆57Updated 9 months ago
ChenWu98 / agent-attack
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
☆115Updated 9 months ago
salman-lui / x-teaming
☆47Updated 6 months ago
William-wAng618 / roboticAttack
Official repo of Exploring the Adversarial Vulnerabilities of Vision-Language-Action Models in Robotics
☆50Updated 3 months ago
isXinLiu / Awesome-MLLM-Safety
Accepted by IJCAI-24 Survey Track
☆223Updated last year
yibo-miao / T2VSafetyBench
☆25Updated last year
NY1024 / Foundation-Model-Paper-Notes
☆69Updated 6 months ago
CryptoAILab / FigStep
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆179Updated 5 months ago
roywang021 / IDEATOR
Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
☆15Updated 4 months ago
SaFoLab-WISC / JailBreakV_28K
[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and fur…
☆81Updated 6 months ago