SafeArena is a benchmark for assessing the harmful capabilities of web agents
☆21Apr 23, 2025Updated 11 months ago
Alternatives and similar repositories for safearena
Users that are interested in safearena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 8 months ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- AgentRewardBench: Evaluating Automatic Evaluations of Web Agent Trajectories☆42Aug 7, 2025Updated 8 months ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆16Jan 8, 2022Updated 4 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆24Jan 6, 2026Updated 3 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ICLR 2025] No Preference Left Behind: Group Distributional Preference Optimization☆15Apr 21, 2025Updated 11 months ago
- Understanding the interplay between memorization and generalization in neural networks, featuring MAT, a learning algorithm to enhance ro…☆40Dec 19, 2024Updated last year
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆19Oct 17, 2025Updated 6 months ago
- Visual Verb Sense Disambiguation☆13Apr 26, 2019Updated 6 years ago
- This repository contains the dataset and code for our ACL'23 publication: "MatSci-NLP: Evaluating Scientific Language Models on Materials…☆17Nov 21, 2023Updated 2 years ago
- The official Genbench Collaborative Benchmarking Task repository 2023 (Archived)☆14Jul 23, 2024Updated last year
- CMake is an open-source, cross-platform family of tools designed to build, test and package software. This repo contains dockerfile for C…☆12Jun 18, 2020Updated 5 years ago
- ACL 2020 papers by authors who are members of underrepresented groups (URMs)☆16Jul 10, 2020Updated 5 years ago
- This is the repository of the Dense Hierarchical Retrieval for Open-Domain Question Answering☆14Dec 23, 2021Updated 4 years ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- ☆11Feb 28, 2024Updated 2 years ago
- EMNLP 2020: On the Ability and Limitations of Transformers to Recognize Formal Languages☆24Oct 10, 2020Updated 5 years ago
- ☆10Aug 22, 2022Updated 3 years ago
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios☆21Mar 12, 2026Updated last month
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Aug 12, 2024Updated last year
- Code for WACV 2023 paper "VLC-BERT: Visual Question Answering with Contextualized Commonsense Knowledge"☆21May 8, 2023Updated 2 years ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆14Mar 1, 2025Updated last year
- Data splits for the NAACL 2016 paper☆22Mar 17, 2016Updated 10 years ago
- Deploy open-source AI quickly and easily - Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- Tools for managing BibTeX bibliographies: automatically update preprints to published versions and filter to only cited references.☆83Feb 22, 2026Updated last month
- Curated list of awesome ML Visualization Libraries☆13Jun 23, 2023Updated 2 years ago
- SeeGULL is a broad-coverage stereotype dataset in English containing stereotypes about identity groups spanning 178 countries across 8 di…☆38Sep 25, 2023Updated 2 years ago
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"☆21Mar 22, 2024Updated 2 years ago
- ☆18Feb 17, 2025Updated last year
- ☆16Nov 18, 2024Updated last year
- An enterprise deep research benchmark☆35Apr 8, 2026Updated last week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ICML2025: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework☆14Jun 24, 2025Updated 9 months ago
- [WSDM'2025] "MixRec: Heterogeneous Graph Collaborative Filtering"☆20Dec 19, 2024Updated last year
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 10 months ago
- [NDSS'25] The official implementation of safety misalignment.☆18Jan 8, 2025Updated last year
- ☆23Jan 5, 2026Updated 3 months ago
- Common repo and documentation space for DataMeet Pune chapter☆17Jun 7, 2019Updated 6 years ago