yjyddq / RiOSWorldLinks

[NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents

☆47

Alternatives and similar repositories for RiOSWorld

Users that are interested in RiOSWorld are comparing it to the libraries listed below

Sorting:

AI45Lab / VLSBench
[ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety
☆53Updated 4 months ago
AI45Lab / X-Boundary
The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Compromising Usa…
☆37Updated this week
ydyjya / LLM-IHS-Explanation
☆55Updated last year
WangCheng0116 / Awesome-LRMs-Safety
Official repository for "Safety in Large Reasoning Models: A Survey" - Exploring safety risks, attacks, and defenses for Large Reasoning …
☆80Updated 3 months ago
wonderNefelibata / Awesome-LRM-Safety
Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …
☆78Updated this week
itsqyh / Awesome-LMMs-Mechanistic-Interpretability
A curated collection of resources focused on the Mechanistic Interpretability (MI) of Large Multimodal Models (LMMs). This repository agg…
☆164Updated last month
isXinLiu / MM-SafetyBench
Accepted by ECCV 2024
☆177Updated last year
isXinLiu / Awesome-MLLM-Safety
Accepted by IJCAI-24 Survey Track
☆223Updated last year
EchoseChen / SPA-VL-RLHF
The reinforcement learning codes for dataset SPA-VL
☆42Updated last year
TrustGen / TrustEval-toolkit
Toolkit for evaluating the trustworthiness of generative foundation models.
☆123Updated 3 months ago
eric-ai-lab / MSSBench
[ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"
☆30Updated 5 months ago
QingyangZhang / Label-Free-RLVR
☆289Updated 4 months ago
salman-lui / x-teaming
☆47Updated 6 months ago
CryptoAILab / FigStep
[AAAI'25 (Oral)] Jailbreaking Large Vision-language Models via Typographic Visual Prompts
☆179Updated 5 months ago
XiaoYee / Awesome_Efficient_LRM_Reasoning
😎 A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, Agent, and Beyond
☆317Updated last month
OpenSafetyLab / SALAD-BENCH
【ACL 2024】 SALAD benchmark & MD-Judge
☆166Updated 8 months ago
ChnQ / MI-Peaks
☆55Updated 4 months ago
EIT-NLP / Awesome-Latent-CoT
This repository contains a regularly updated paper list for LLMs-reasoning-in-latent-space.
☆194Updated 2 weeks ago
Unispac / shallow-vs-deep-alignment
Official Repository for The Paper: Safety Alignment Should Be Made More Than Just a Few Tokens Deep
☆165Updated 7 months ago
Joshua-Ren / Learning_dynamics_LLM
☆184Updated 6 months ago
StarDewXXX / Awesome-Hybrid-CoT-Reasoning
☆56Updated 5 months ago
AI45Lab / ActorAttack
☆111Updated 9 months ago
Blueyee / Efficient-CoT-LRMs
Chain of Thoughts (CoT) is so hot! so long! We need short reasoning process!
☆70Updated 7 months ago
SaFoLab-WISC / AdaShield
[ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…
☆68Updated last year
thu-ml / MLA-Trust
A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions thro…
☆57Updated 5 months ago
OSU-NLP-Group / AgentSafety
☆130Updated 3 weeks ago
jianghoucheng / AlphaEdit
AlphaEdit: Null-Space Constrained Knowledge Editing for Language Models, ICLR 2025 (Outstanding Paper)
☆373Updated last month
AI4Good24 / PsySafe
☆50Updated 9 months ago
liudaizong / Awesome-LVLM-Attack
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆434Updated 2 weeks ago
Jihuai-wpy / InferAligner
☆37Updated last year