Code, benchmark and environment for "OS-Sentinel: Towards Safety-Enhanced Mobile GUI Agents via Hybrid Validation in Realistic Workflows"
☆39Nov 10, 2025Updated 4 months ago
Alternatives and similar repositories for OS-Sentinel
Users that are interested in OS-Sentinel are comparing it to the libraries listed below
Sorting:
- Code for Research Project TLDR☆25Jul 28, 2025Updated 7 months ago
- ☆24Jun 13, 2023Updated 2 years ago
- Code for "From Ideal to Real: Unified and Data-Efficient Dense Prediction for Real-World Scenarios"☆28Jul 7, 2025Updated 8 months ago
- An Arena-style Automated Evaluation Benchmark for Detailed Captioning☆58Jun 1, 2025Updated 9 months ago
- ☆18Nov 3, 2025Updated 4 months ago
- ☆75Dec 6, 2024Updated last year
- ☆48Oct 2, 2025Updated 5 months ago
- [ACL 2025] AgentStore: Scalable Integration of Heterogeneous Agents As Specialized Generalist Computer Assistant☆44Dec 19, 2024Updated last year
- ☆40Jan 23, 2024Updated 2 years ago
- ☆12Aug 8, 2024Updated last year
- Retrieved Sequence Augmentation for Protein Representation Learning☆53Nov 1, 2023Updated 2 years ago
- Repo for Anonymous purpose, pls don't distribute☆10Oct 2, 2024Updated last year
- GlotEval: a unified evaluation toolkit designed to benchmark multilingual Large Language Models (LLMs) in a language-specific way☆18Nov 4, 2025Updated 4 months ago
- [NeurIPS 2024 poster] Cross-model Control: Improving Multiple Large Language Models in One-time Training☆14Oct 25, 2024Updated last year
- Internal utility libraries for Pkl☆16Mar 10, 2026Updated last week
- [AAAI 2025 𝐎𝐫𝐚𝐥] MuMA-ToM: Multi-modal Multi-Agent Theory of Mind☆39Jan 23, 2025Updated last year
- ☆30Jan 15, 2026Updated 2 months ago
- PyTorch implementation for PaLM: A Hybrid Parser and Language Model.☆10Jan 7, 2020Updated 6 years ago
- Official codebase for “In-Context Learning with Many Demonstration Examples”☆16Feb 13, 2023Updated 3 years ago
- An official implementation of Random Policy Valuation is Enough for LLM Reasoning with Verifiable Rewards☆37Oct 3, 2025Updated 5 months ago
- Implementation of the paper "Improving the Accuracy-Robustness Trade-off of Classifiers via Adaptive Smoothing".☆10Feb 6, 2024Updated 2 years ago
- Rethinking the Trust Region in LLM Reinforcement Learning☆45Mar 2, 2026Updated 2 weeks ago
- ☆21Dec 3, 2025Updated 3 months ago
- ☆10Dec 21, 2019Updated 6 years ago
- A Large-Scale Open-Domain Tabular Question Answering Dataset for the Real Estate Sector☆14Jun 26, 2025Updated 8 months ago
- Better Transition-Based AMR Parsing with a Refined Search Space (authors' DyNet implementation for the EMNLP18 paper)☆10Jun 13, 2019Updated 6 years ago
- Official Implementation for *PaCo-RL: Advancing Reinforcement Learning for Consistent Image Generation with Pairwise Reward Modeling*☆32Dec 13, 2025Updated 3 months ago
- The official repository of the first version of ACE-Brain foundation model.☆62Mar 13, 2026Updated last week
- ICCV 2021 papers and code focus on adversarial attacks and defense☆11Nov 5, 2021Updated 4 years ago
- QGEval: A Benchmark for Question Generation Evaluation☆19Nov 7, 2024Updated last year
- Motion Generation from Fine-grained Textual Descriptions (LREC-COLING 2024)☆15Jun 13, 2024Updated last year
- Dependency Grammar Induction☆18Feb 11, 2019Updated 7 years ago
- The training codes of Jasper-Token-Compression-600M☆19Nov 19, 2025Updated 4 months ago
- content-neutral dataset of logical reasoning☆20Mar 21, 2025Updated last year
- A Self-Training Framework for Vision-Language Reasoning☆88Jan 23, 2025Updated last year
- ☆12Sep 22, 2023Updated 2 years ago
- ☆32Oct 23, 2025Updated 4 months ago
- On the Robustness of GUI Grounding Models Against Image Attacks☆12Apr 8, 2025Updated 11 months ago
- [ICML 2025] EffiCoder: Enhancing Code Generation in Large Language Models through Efficiency-Aware Fine-tuning☆16May 24, 2025Updated 9 months ago