jylee425/mobilesafetybench

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/jylee425/mobilesafetybench)

jylee425 / mobilesafetybench

Evaluating Safety of Autonomous Agents in Mobile Device Control (AAAI 2026 AI Alignment Track)

☆34

Alternatives and similar repositories for mobilesafetybench

Users that are interested in mobilesafetybench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

minnesotanlp / infoVerse
View on GitHub
Jaehyung Kim et al's ACL 2023 paper on "infoVerse: A Universal Framework for Dataset Characterization with Multidimensional Meta-informat…
☆16Jun 28, 2023Updated 3 years ago
choi403 / DiffusionGuard
View on GitHub
DiffusionGuard: A Robust Defense Against Malicious Diffusion-based Image Editing (ICLR 2025)
☆47May 18, 2025Updated last year
csmile-1006 / DEAS-Isaac-GR00T
View on GitHub
DEAS + Isaac-GR00T + RoboCasa
☆20Nov 22, 2025Updated 8 months ago
csmile-1006 / ARP
View on GitHub
Guide Your Agent with Adaptive Multimodal Rewards (NeurIPS 2023 Accepted)
☆33Sep 25, 2023Updated 2 years ago
jihoontack / GradNCP
View on GitHub
Learning Large-scale Neural Fields via Context Pruned Meta-Learning (NeurIPS 2023)
☆28Sep 24, 2023Updated 2 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
sihyun-yu / RoMA
View on GitHub
[NeurIPS'21] RoMA: Robust Model Adaptation for Offline Model-based Optimization
☆15Oct 28, 2021Updated 4 years ago
albert-y1n / PISmith
View on GitHub
PISmith: Reinforcement Learning-based Red Teaming for Prompt Injection Defenses
☆22Jul 17, 2026Updated last week
huiwon-jang / RSP
View on GitHub
Visual Representation Learning with Stochastic Frame Prediction (ICML 2024)
☆28Nov 27, 2024Updated last year
younggyoseo / MV-MWM
View on GitHub
☆61Apr 16, 2023Updated 3 years ago
junsu-kim97 / PIG
View on GitHub
PyTorch code accompanying the paper "Imitating Graph-Based Planning with Goal-Conditioned Policies" (ICLR 2023).
☆21Mar 4, 2023Updated 3 years ago
StarWalkin / UI-NEXUS
View on GitHub
This is the official repository of the paper "Atomic-to-Compositional Generalization for Mobile Agents with A New Benchmark and Schedulin…
☆14Jul 27, 2025Updated 11 months ago
jihoontack / SiMT
View on GitHub
Meta-Learning with Self-Improving Momentum Target (NeurIPS 2022)
☆23Oct 12, 2022Updated 3 years ago
bbuing9 / DND
View on GitHub
Code for the paper "What Makes Better Augmentation Strategies? Augment Difficult but Not too Different" (ICLR 22)
☆12Aug 28, 2023Updated 2 years ago
alinlab / MetaMAE
View on GitHub
Modality-Agnostic Self-Supervised Learning with Meta-Learned Masked Auto-Encoder (NeurIPS 2023)
☆10Jun 5, 2024Updated 2 years ago
Virtual machines for every use case on DigitalOcean • Ad
Get dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
choi403 / ALG
View on GitHub
Improving Motion in Image-to-Video Models via Adaptive Low-Pass Guidance (CVPR 2026 Highlight)
☆59Feb 23, 2026Updated 5 months ago
sungsoo-ahn / learning_what_to_defer
View on GitHub
☆24Dec 4, 2020Updated 5 years ago
hyunseoklee-ai / ReMoDetect
View on GitHub
ReMoDetect: Reward Models Recognize Aligned LLM's Generations (NeurIPS 2024)
☆17Nov 15, 2024Updated last year
jaehyun513 / P2T
View on GitHub
Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).
☆13Aug 6, 2024Updated last year
Zsbyqx20 / AgentHazard
View on GitHub
Mobile GUI Agents under Real-world Threats: Are We There Yet?
☆17May 18, 2026Updated 2 months ago
subin-kim-cv / CSD
View on GitHub
Collaborative Score Distillation for Consistent Visual Synthesis (NeurIPS 2023)
☆120Nov 21, 2023Updated 2 years ago
hankook / CLEL
View on GitHub
☆17Mar 2, 2023Updated 3 years ago
younggyoseo / trajectory_mcl
View on GitHub
Trajectory-wise Multiple Choice Learning for Dynamics Generalization in Reinforcement Learning (NeurIPS 2020)
☆39Oct 27, 2020Updated 5 years ago
kingdy2002 / VCSE
View on GitHub
☆18Jun 8, 2023Updated 3 years ago
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
bbuing9 / ICLR24_SuRe
View on GitHub
Official Code for the paper "SuRe: Summarizing Retrievals using Answer Candidates for Open-domain QA of LLMs" (ICLR 2024)
☆27May 7, 2024Updated 2 years ago
EternityJune25 / MVISU-Bench
View on GitHub
[ACM MM 2025 🔥 Oral] MVISU-Bench: Benchmarking Mobile Agents for Real-World Tasks by Multi-App, Vague, Interactive, Single-App and Uneth…
☆15Mar 13, 2026Updated 4 months ago
ZZZhr-1 / Robust_GUI_Grounding
View on GitHub
On the Robustness of GUI Grounding Models Against Image Attacks
☆12Apr 8, 2025Updated last year
younggyoseo / apv
View on GitHub
☆72Jun 20, 2022Updated 4 years ago
chang-github-00 / Predictive-Decoding
View on GitHub
Repo for Anonymous purpose, pls don't distribute
☆10Oct 2, 2024Updated last year
hkust-nlp / GUIMid
View on GitHub
☆22May 3, 2025Updated last year
alinlab / consistency-adversarial
View on GitHub
Consistency Regularization for Adversarial Robustness (AAAI 2022)
☆53Dec 12, 2021Updated 4 years ago
andyzoujm / breaking-llama-guard
View on GitHub
Code to break Llama Guard
☆32Dec 7, 2023Updated 2 years ago
thu-ml / MLA-Trust
View on GitHub
A toolbox for benchmarking Multimodal LLM Agents trustworthiness across truthfulness, controllability, safety and privacy dimensions thro…
☆63Jan 9, 2026Updated 6 months ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
jihoontack / MAC
View on GitHub
Online Adaptation of Language Models with a Memory of Amortized Contexts (NeurIPS 2024)
☆77Aug 3, 2024Updated last year
OS-Copilot / OS-Sentinel
View on GitHub
[ACL 2026] Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic…
☆49Jul 5, 2026Updated 2 weeks ago
bor0 / dafny-tutorial
View on GitHub
Exercises for the Dafny Tutorial
☆14May 21, 2018Updated 8 years ago
pokaxpoka / B_Pref
View on GitHub
☆54Nov 10, 2022Updated 3 years ago
showlab / macosworld
View on GitHub
☆35Jan 28, 2026Updated 5 months ago
OSU-NLP-Group / AgentAttack
View on GitHub
☆22Oct 25, 2024Updated last year
OSU-NLP-Group / SeeActChromeExtension
View on GitHub
☆18Jan 3, 2025Updated last year