[NeurIPS 2025] Official repository of RiOSWorld: Benchmarking the Risk of Multimodal Computer-Use Agents
☆118Dec 2, 2025Updated 5 months ago
Alternatives and similar repositories for RiOSWorld
Users that are interested in RiOSWorld are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official Repository of "Taming Masked Diffusion Language Models via Consistency Trajectory Reinforcement Learning with Fewer Decoding Ste…☆28Mar 9, 2026Updated 2 months ago
- A simple anime statitics tracker that helps you explore seasonal anime from 2006 onwards with a lot interesting data and visualization.☆42May 17, 2026Updated last week
- Codes for paper "SafeAgentBench: A Benchmark for Safe Task Planning of \\ Embodied LLM Agents"☆70Feb 25, 2025Updated last year
- [ACL 2025] Data and Code for Paper VLSBench: Unveiling Visual Leakage in Multimodal Safety☆61Jul 21, 2025Updated 10 months ago
- Official repository of DARE: Diffusion Large Language Models Alignment and Reinforcement Executor☆198Apr 28, 2026Updated 3 weeks ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Diagnostic Framework for LLMs and MLLMs☆38Mar 2, 2026Updated 2 months ago
- ☆67Jul 14, 2025Updated 10 months ago
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 10 months ago
- [ECCV 2024] The official code for "AdaShield: Safeguarding Multimodal Large Language Models from Structure-based Attack via Adaptive Shi…☆73Feb 9, 2026Updated 3 months ago
- [ICLR 2025] Official codebase for the ICLR 2025 paper "Multimodal Situational Safety"☆35Jun 23, 2025Updated 11 months ago
- [EMNLP 2025] The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Com…☆40Nov 24, 2025Updated 6 months ago
- [ICML 2024] Code for the paper "MoE-RBench: Towards Building Reliable Language Models with Sparse Mixture-of-Experts"☆10Jul 1, 2024Updated last year
- 让AI完全接管你的博客☆31Nov 2, 2025Updated 6 months ago
- A virtual clinical environment for self‑evolving LLM diagnostic agents.☆104Feb 12, 2026Updated 3 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Official Implementation of "ToolSafe: Enhancing Tool Invocation Safety of LLM-based Agents via Proactive Step-level Guardrail and Feedbac…☆58Mar 25, 2026Updated 2 months ago
- ccap `(C)amera(CAP)ture` is a simple and easy-to-use C/C++ camera capture library designed to provide you with simple and efficient camer…☆182May 18, 2026Updated last week
- This is a simple implementation for crypto research agent inspired by Claude Skills repo☆106Jan 28, 2026Updated 3 months ago
- team Doggeee's solution to Ego4D LTA challenge@CVPRW23'☆14Nov 4, 2023Updated 2 years ago
- [ICLR 2025] This repo is the official implementation of "The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs".☆13Jan 25, 2025Updated last year
- ☆36Oct 22, 2025Updated 7 months ago
- [ACL 2025] "CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought"☆17Apr 3, 2025Updated last year
- ☆125Feb 6, 2026Updated 3 months ago
- 十项全能玄学术数工作站Win端,包含:紫微斗数、八字、占星、六壬、遁甲、太乙、六爻、统摄法、风水、绝大部分主流推运技法(包括正统主限法)、内置AI分析等等。于旧星阙Horosa基础上改良制作,请不要忘记爽哥和郑大哥的贡献。☆107Updated this week
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…☆14Jul 27, 2025Updated 10 months ago
- [NeurIPS 2024] "Self-Calibrated Tuning of Vision-Language Models for Out-of-Distribution Detection"☆13Oct 28, 2024Updated last year
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- 一个机械设计课设的计算器,可以计算出包括电动机,传动装置,V带轮,齿轮,轴,轴承的几何或者力,运动学参数数值。☆19Jan 5, 2023Updated 3 years ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- ☆253Feb 19, 2026Updated 3 months ago
- Repo for paper "Rethinking Generalization in Reasoning SFT: A Conditional Analysis on Optimization, Data, and Model Capability"☆106Apr 23, 2026Updated last month
- a Robomaster radar displayer☆15Mar 14, 2022Updated 4 years ago
- Llemma formal2formal (tactic prediction) theorem proving experiments☆20Oct 17, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- WraAct is a tool to construct the convex hull of various activation functions.☆33May 18, 2026Updated last week
- Official reposity for paper "High-Dimension Human Value Representation in Large Language Models" (NAACL'25 Main)☆23Jul 9, 2024Updated last year
- ☆100Jan 28, 2026Updated 3 months ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- ☆106Feb 4, 2024Updated 2 years ago
- [Preprint 2025] Causal Prompt Calibration Guided Segment Anything Model for Open-Vocabulary Multi-Entity Segmentation☆38Oct 16, 2025Updated 7 months ago
- MobileUse: an open-source mobile GUI agent for Android phone automation, AndroidWorld/AndroidLab evaluation, hierarchical reflection, and…☆148May 7, 2026Updated 2 weeks ago