[ICLR'26 Oral] RedTeamCUA: Realistic Adversarial Testing of Computer-Use Agents in Hybrid Web-OS Environments
☆43Feb 9, 2026Updated last month
Alternatives and similar repositories for RedTeamCUA
Users that are interested in RedTeamCUA are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- official code repo for paper "Merging Models on the Fly Without Retraining: A Sequential Approach to Scalable Continual Model Merging"☆25Oct 11, 2025Updated 5 months ago
- [NeurIPS 2023] Adaptive Test-Time Personalization for Federated Learning. Wenxuan Bao, Tianxin Wei, Haohan Wang, Jingrui He.☆25Dec 25, 2023Updated 2 years ago
- Code for Voice Jailbreak Attacks Against GPT-4o.☆37May 31, 2024Updated last year
- [ECCV24] "Challenging Forgets: Unveiling the Worst-Case Forget Sets in Machine Unlearning" by Chongyu Fan*, Jiancheng Liu*, Alfred Hero, …☆25May 27, 2025Updated 9 months ago
- Code relative to "Adversarial robustness against multiple and single $l_p$-threat models via quick fine-tuning of robust classifiers"☆19Nov 30, 2022Updated 3 years ago
- Simple, predictable pricing with DigitalOcean hosting • AdAlways know what you'll pay with monthly caps and flat pricing. Enterprise-grade infrastructure trusted by 600k+ customers.
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆55Apr 6, 2025Updated 11 months ago
- Domain Generation Algorithms research papers, datasets and code☆15May 17, 2020Updated 5 years ago
- FairPrep is a design and evaluation framework for fairness-enhancing interventions that treats data as a first-class citizen.☆11Mar 24, 2023Updated 3 years ago
- On the effectiveness of adversarial training against common corruptions [UAI 2022]☆30May 16, 2022Updated 3 years ago
- Q&A dataset for many-shot jailbreaking☆14Jul 19, 2024Updated last year
- ☆49Jun 19, 2024Updated last year
- Applying SAEs for fine-grained control☆26Dec 15, 2024Updated last year
- https://arxiv.org/abs/2102.12594☆14Oct 3, 2023Updated 2 years ago
- ☆37Oct 2, 2024Updated last year
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- The official implementation for paper: Vision-Language Models are Strong Noisy Label Detectors☆16Mar 31, 2025Updated 11 months ago
- ☆11Mar 24, 2023Updated 3 years ago
- ☆38Dec 19, 2024Updated last year
- [ICLR 2022 official code] Robust Learning Meets Generative Models: Can Proxy Distributions Improve Adversarial Robustness?☆29Mar 15, 2022Updated 4 years ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- ☆36May 21, 2025Updated 10 months ago
- CNCA: Toward Customizable and Natural Generation of Adversarial Camouflage for Vehicle Detectors☆15Nov 3, 2024Updated last year
- Code for "Consistent Estimators for Learning to Defer to an Expert" (ICML 2020)☆16Jan 28, 2023Updated 3 years ago
- A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…☆47Feb 15, 2026Updated last month
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- (TPAMI 2026) Complementary Text-Guided Attention for Zero-Shot Adversarial Robustness & & (NeurIPS 2024) Text-Guided Attention is All Y…☆16Mar 6, 2026Updated 2 weeks ago
- WAFFLE: Watermarking in Federated Learning☆23Aug 21, 2023Updated 2 years ago
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 8 months ago
- The code for our work☆18Apr 7, 2024Updated last year
- Accurate predictions of Fantasy Football scores can make a difference in a Fantasy Football league. The provided files were used to creat…☆14May 26, 2015Updated 10 years ago
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".☆55Updated this week
- ☆15Mar 28, 2024Updated last year
- A collection of publications that works on code models but beyond focusing on the accuracies.☆13Jun 30, 2023Updated 2 years ago
- ☆25Sep 3, 2025Updated 6 months ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- White-box Fairness Testing through Adversarial Sampling☆14Apr 16, 2021Updated 4 years ago
- ACL24☆11Jun 7, 2024Updated last year
- Implementation of IEEE TNNLS 2023 and Elsevier PR 2023 papers on backdoor watermarking for deep classification models with unambiguity an…☆19Jul 27, 2023Updated 2 years ago
- Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data☆13Sep 20, 2022Updated 3 years ago
- Website & Documentation: https://sbaresearch.github.io/model-watermarking/☆25Sep 22, 2023Updated 2 years ago
- ☆24Apr 14, 2019Updated 6 years ago
- [ACL 25] SafeChain: Safety of Language Models with Long Chain-of-Thought Reasoning Capabilities☆29Apr 2, 2025Updated 11 months ago