[NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
☆117Dec 2, 2025Updated 5 months ago
Alternatives and similar repositories for RiOSWorld
Users that are interested in RiOSWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- A simple anime statitics tracker that helps you explore seasonal anime from 2006 onwards with a lot interesting data and visualization.☆43Apr 26, 2026Updated last week
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆60Jul 21, 2025Updated 9 months ago
- ☆130Feb 3, 2025Updated last year
- Diagnostic Framework for LLMs and MLLMs☆36Mar 2, 2026Updated 2 months ago
- ☆67Jul 14, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 9 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆33Jun 23, 2025Updated 10 months ago
- A composable CLI project generator and build helper for GoFiber applications.☆43Updated this week
- [EMNLP 2025] The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Com…☆40Nov 24, 2025Updated 5 months ago
- Java decompilation & deobfuscation lab - dockerized toolset☆17Apr 15, 2026Updated 3 weeks ago
- The officalimplement of dLLM-Factory☆25Jul 12, 2025Updated 9 months ago
- [NeurIPS 2023] "Diversified Outlier Exposure for Out-of-Distribution Detection via Informative Extrapolation"☆11Oct 6, 2023Updated 2 years ago
- [AAAI 2026] Data and Code for Paper IS-Bench: Evaluating Interactive Safety of VLM-Driven Embodied Agents in Daily Household Tasks☆43Nov 24, 2025Updated 5 months ago
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆103Feb 12, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆55Mar 25, 2026Updated last month
- ccap `(C)amera(CAP)ture` is a simple and easy-to-use C/C++ camera capture library designed to provide you with simple and efficient camer…☆162Updated this week
- ☆221Oct 12, 2025Updated 6 months ago
- This is a simple implementation for crypto research agent inspired by Claude Skills repo☆106Jan 28, 2026Updated 3 months ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- ☆36Oct 22, 2025Updated 6 months ago
- [ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆18Apr 3, 2025Updated last year
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 9 months ago
- This is the official implementation of the method presented in the paper "Uncertainty-Aware Test-Time Optimization for 3D Human Pose Esti…☆36Apr 9, 2026Updated 3 weeks ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"☆13Oct 28, 2024Updated last year
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- Satori botnet variant☆13Mar 19, 2022Updated 4 years ago
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆100Apr 23, 2026Updated last week
- WraAct is a tool to construct the convex hull of various activation functions.☆33Feb 13, 2026Updated 2 months ago
- ☆99Jan 28, 2026Updated 3 months ago
- Official reposity for paper "High-Dimension Human Value Representation in Large Language Models" (NAACL'25 Main)☆23Jul 9, 2024Updated last year
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- 《MobileUse: A Hierarchical Reflection-Driven GUI Agent for Autonomous Mobile Operation》☆142Feb 2, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆16Sep 17, 2024Updated last year
- Secure Inference Resilient Against Malicious Clients☆14May 3, 2022Updated 4 years ago
- ☆32Jan 28, 2026Updated 3 months ago
- Shadow Attack, LiRA, Quantile Regression and RMIA implementations in PyTorch (Online version)☆14Nov 8, 2024Updated last year
- [ACL 2025] Research code for the paper "OS-Kairos: Adaptive Interaction for MLLM-Powered GUI Agents"☆21Jun 19, 2025Updated 10 months ago
- Official code repo for the paper "MemGUI-Bench: Benchmarking Memory of Mobile GUI Agents in Dynamic Environments"☆41Apr 28, 2026Updated last week
- Code for the paper: "Recursive Self-Attention Modules-Based Network for Panchromatic and Multispectral Image Fusion", JSTARS 2023.☆12Apr 18, 2024Updated 2 years ago