SafeArena is a benchmark for assessing the harmful capabilities of web agents
☆21Apr 23, 2025Updated 10 months ago
Alternatives and similar repositories for safearena
Users that are interested in safearena are comparing it to the libraries listed below
Sorting:
- Synthetic Data Generation for Evaluation☆13Feb 21, 2025Updated last year
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 6 months ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆40Aug 7, 2025Updated 7 months ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios☆19Updated this week
- Jupyter notebook templates for processing and analyzing neuroscience data.☆14Updated this week
- ☆13Feb 4, 2025Updated last year
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆13Mar 1, 2025Updated last year
- Fully automatic skin lesion segmentation using the Berkeley wavelet transform and UNet algorithm.☆12Jun 1, 2021Updated 4 years ago
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- heterogeneous graph attention network for SMEs bankruptcy prediction☆12Feb 26, 2021Updated 5 years ago
- We enable LLM with personalization capability☆11Nov 16, 2023Updated 2 years ago
- ☆10Aug 22, 2022Updated 3 years ago
- A small framework for benchmarking machine learning models.☆21Jun 6, 2025Updated 9 months ago
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- ☆11Feb 28, 2024Updated 2 years ago
- ICML2025: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework☆14Jun 24, 2025Updated 8 months ago
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- ☆20Jan 5, 2026Updated 2 months ago
- enchmarking Large Language Models' Resistance to Malicious Code☆14Dec 1, 2024Updated last year
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- ☆11Mar 5, 2025Updated last year
- ☆14Sep 4, 2024Updated last year
- customized dash NGL viewer☆12Jan 6, 2023Updated 3 years ago
- Manish-GenAI / Deep-Learning-Based-Approach-to-Anomaly-Detection-Techniques-for-Large-Acoustic-Data-Deep-Learning-Based Approach to Anomaly Detection Techniques for Large Acoustic Data in Machine Operation.Developed a deep leaning algor…☆18Jun 6, 2025Updated 9 months ago
- Package for Computational Biology Reading Group☆13Apr 20, 2022Updated 3 years ago
- Scorpius: Poisoning scientific knowledge using large language models☆11Aug 3, 2024Updated last year
- Curated list of awesome ML Visualization Libraries☆13Jun 23, 2023Updated 2 years ago
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- Face Interaction Graph Networks: A GNN-based rigid body physics simulator☆19Sep 30, 2025Updated 5 months ago
- [COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection☆17Jan 21, 2025Updated last year
- A collection of functions to help you easily train and run Tensorflow Keras. It includes 1-line auto-TPU support, GPU memory management, …☆12Jul 6, 2022Updated 3 years ago
- Aurora is a central design system for all products and applications for the Open, Accessible Digital Workspace. This repo is for all code…☆16Feb 23, 2024Updated 2 years ago
- An enterprise deep research benchmark☆33Updated this week
- ☆31Feb 6, 2026Updated last month
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆65Oct 28, 2025Updated 4 months ago
- Cross-GPU KV Cache Marketplace☆23Nov 12, 2025Updated 3 months ago
- Accept by CVPR 2025 (highlight)☆22Jun 8, 2025Updated 9 months ago