McGill-NLP/safearena

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/McGill-NLP/safearena)

McGill-NLP / safearena

SafeArena is a benchmark for assessing the harmful capabilities of web agents

☆24

Alternatives and similar repositories for safearena

Users that are interested in safearena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

McGill-NLP / AdversarialTriggers
View on GitHub
TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models
☆19Aug 17, 2025Updated 11 months ago
ServiceNow / GroundCUA
View on GitHub
GroundCUA
☆132Mar 24, 2026Updated 4 months ago
BangLab-UdeM-Mila / NLP4MatSci-ACL23
View on GitHub
This repository contains the dataset and code for our ACL'23 publication: "MatSci-NLP: Evaluating Scientific Language Models on Materials…
☆17Nov 21, 2023Updated 2 years ago
thu-coai / Agent-SafetyBench
View on GitHub
☆151Aug 11, 2025Updated 11 months ago
sinahmr / LocAtViT
View on GitHub
PyTorch Implementation of LocAtViT in "Locality-Attending Vision Transformer" (ICLR 2026)
☆19Mar 10, 2026Updated 4 months ago
AI Agents on DigitalOcean Gradient AI Platform • Ad
Build production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
ServiceNow / drbench
View on GitHub
An enterprise deep research benchmark
☆40Apr 22, 2026Updated 3 months ago
Tele-EVOL / TeleAI-Safety
View on GitHub
☆27Jan 5, 2026Updated 6 months ago
TeamPigeonLab / CS-DJ
View on GitHub
Accept by CVPR 2025 (highlight)
☆25Jun 8, 2025Updated last year
xhluca / material-ui-in-pyodide
View on GitHub
☆10Aug 22, 2022Updated 3 years ago
WadeYin9712 / GeoMLAMA
View on GitHub
☆15Oct 24, 2022Updated 3 years ago
ServiceNow / AgentLab
View on GitHub
AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…
☆610Jul 17, 2026Updated last week
McGill-NLP / feedbackqa
View on GitHub
FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback
☆12Jul 13, 2022Updated 4 years ago
Hannibal046 / GPT-OSS-BrowseCompPlus-Eval
View on GitHub
Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools
☆20Oct 17, 2025Updated 9 months ago
fstrub95 / torch.github.io
View on GitHub
Torch's web page.
☆12Mar 9, 2016Updated 10 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
lamalab-org / chembench-paper
View on GitHub
☆25Jan 22, 2025Updated last year
nlpub / hyperstar
View on GitHub
Hyperstar: Negative Sampling Improves Hypernymy Extraction Based on Projection Learning.
☆24Jan 2, 2020Updated 6 years ago
qqaatw / pytorch-realm-orqa
View on GitHub
PyTorch reimplementation of REALM and ORQA
☆22Feb 3, 2022Updated 4 years ago
McGill-NLP / instruct-qa
View on GitHub
Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"
☆87Aug 12, 2024Updated last year
Heidelberg-NLP / CC-SHAP
View on GitHub
Code for "On Measuring Faithfulness of Natural Language Explanations"
☆23Jul 14, 2026Updated 2 weeks ago
ServiceNow / DoomArena
View on GitHub
DoomArena is a Framework for Testing AI Agents Against Evolving Security Threats
☆62Sep 12, 2025Updated 10 months ago
yjwtheonly / Scorpius
View on GitHub
Scorpius: Poisoning scientific knowledge using large language models
☆11Aug 3, 2024Updated last year
aditya10 / VLC-BERT
View on GitHub
Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"
☆21May 8, 2023Updated 3 years ago
scalable-model-editing / unified-model-editing
View on GitHub
We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.
☆29Dec 16, 2024Updated last year
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
Aaquib111 / edge-attribution-patching
View on GitHub
Code for my NeurIPS 2024 ATTRIB paper titled "Attribution Patching Outperforms Automated Circuit Discovery"
☆48May 31, 2024Updated 2 years ago
wahyuhadi / semgrep-server-rules
View on GitHub
☆18Dec 20, 2025Updated 7 months ago
ZJULiHongxin / UIPro
View on GitHub
Advanced GUI agents
☆16Feb 3, 2026Updated 5 months ago
xhluca / awesome-ml-visualization
View on GitHub
Curated list of awesome ML Visualization Libraries
☆15Jun 23, 2023Updated 3 years ago
google-research-datasets / seegull
View on GitHub
SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 di…
☆38Sep 25, 2023Updated 2 years ago
xhluca / bm25-benchmarks
View on GitHub
☆24Jul 10, 2026Updated 2 weeks ago
SproutNan / AI-Safety_Benchmark
View on GitHub
The official repository for guided jailbreak benchmark
☆31Jul 28, 2025Updated last year
carriex / lfqa_eval
View on GitHub
ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"
☆21Mar 22, 2024Updated 2 years ago
Zhang-Yihao / Adversarial-Representation-Engineering
View on GitHub
Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.
☆20Dec 6, 2024Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
NY1024 / BAP-Jailbreak-Vision-Language-Models-via-Bi-Modal-Adversarial-Prompt
View on GitHub
☆61Jun 5, 2024Updated 2 years ago
HKUDS / MixRec
View on GitHub
[WSDM'2025] "MixRec: Heterogeneous Graph Collaborative Filtering"
☆20Dec 19, 2024Updated last year
mmarius / montreal-things-to-do
View on GitHub
A list of things to do in Montréal.
☆28Oct 6, 2025Updated 9 months ago
EvanZhuang / vector-icl
View on GitHub
Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)
☆24Jun 2, 2025Updated last year
danielemalitesta / Multimodal-DL-4-RecSys
View on GitHub
Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School
☆12Oct 12, 2024Updated last year
OSU-NLP-Group / AmpleGCG
View on GitHub
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
☆87Nov 3, 2024Updated last year
WukLab / osworld-human
View on GitHub
OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents
☆27May 17, 2026Updated 2 months ago