McGill-NLP / safearenaView external linksLinks
SafeArena is a benchmark for assessing the harmful capabilities of web agents
☆21Apr 23, 2025Updated 9 months ago
Alternatives and similar repositories for safearena
Users that are interested in safearena are comparing it to the libraries listed below
Sorting:
- Synthetic Data Generation for Evaluation☆13Feb 21, 2025Updated 11 months ago
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 5 months ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆21Jan 6, 2026Updated last month
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆13Mar 1, 2025Updated 11 months ago
- Jupyter notebook templates for processing and analyzing neuroscience data.☆13Dec 28, 2025Updated last month
- ☆13Feb 4, 2025Updated last year
- ☆12Oct 4, 2021Updated 4 years ago
- Fully automatic skin lesion segmentation using the Berkeley wavelet transform and UNet algorithm.☆12Jun 1, 2021Updated 4 years ago
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- ☆10Aug 22, 2022Updated 3 years ago
- A small framework for benchmarking machine learning models.☆21Jun 6, 2025Updated 8 months ago
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆58Oct 28, 2025Updated 3 months ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- ICML2025: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework☆14Jun 24, 2025Updated 7 months ago
- heterogeneous graph attention network for SMEs bankruptcy prediction☆12Feb 26, 2021Updated 4 years ago
- Skin lesion classification, using Keras and the ISIC 2020 dataset