SafeArena is a benchmark for assessing the harmful capabilities of web agents
☆22Apr 23, 2025Updated last year
Alternatives and similar repositories for safearena
Users that are interested in safearena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Synthetic Data Generation for Evaluation☆14Feb 21, 2025Updated last year
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 9 months ago
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"☆613Oct 7, 2025Updated 7 months ago
- Evaluating GPT-OSS on BrowseComp-Plus with Native Browsering Tools☆20Oct 17, 2025Updated 7 months ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- 🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.☆13Mar 16, 2023Updated 3 years ago
- Visual Verb Sense Disambiguation☆13Apr 26, 2019Updated 7 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- This is the repository of the Dense Hierarchical Retrieval for Open-Domain Question Answering☆14Dec 23, 2021Updated 4 years ago
- ☆11Feb 28, 2024Updated 2 years ago
- ☆10Aug 22, 2022Updated 3 years ago
- ☆15Oct 24, 2022Updated 3 years ago
- A Benchmark for Evaluating Safety and Trustworthiness in Web Agents for Enterprise Scenarios☆24Mar 12, 2026Updated 2 months ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Aug 12, 2024Updated last year
- A basic, free, ad-less, PWA-ready, open-source QR Code generator☆18Aug 12, 2025Updated 9 months ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆14Mar 1, 2025Updated last year
- ☆18Dec 20, 2025Updated 5 months ago
- Implementation of the Mask R-CNN model using OCaml's numerical library Owl.☆19Jan 30, 2020Updated 6 years ago
- customized dash NGL viewer☆12Jan 6, 2023Updated 3 years ago
- ☆25Mar 3, 2026Updated 2 months ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- Tools for managing BibTeX bibliographies: automatically update preprints to published versions and filter to only cited references.☆90May 19, 2026Updated last week
- Curated list of awesome ML Visualization Libraries☆14Jun 23, 2023Updated 2 years ago
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"☆21Mar 22, 2024Updated 2 years ago
- AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM☆87Nov 3, 2024Updated last year
- ☆16Nov 18, 2024Updated last year
- ☆24Apr 29, 2026Updated last month
- code of paper "Defending Against Alignment-Breaking Attacks via Robustly Aligned LLM"☆14Nov 17, 2023Updated 2 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Official implementation of Vector-ICL: In-context Learning with Continuous Vector Representations (ICLR 2025)☆23Jun 2, 2025Updated 11 months ago
- Code for "On Measuring Faithfulness of Natural Language Explanations"☆22Jul 23, 2024Updated last year
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- AgentLab: An open-source framework for developing, testing, and benchmarking web agents on diverse tasks, designed for scalability and re…☆585Mar 17, 2026Updated 2 months ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆139Apr 30, 2024Updated 2 years ago
- Common repo and documentation space for DataMeet Pune chapter☆17Jun 7, 2019Updated 6 years ago
- Unofficial LaTex templates for thesis and IEEE conference at National Taiwan University. 國立臺灣大學電機資訊學院碩博士論文及 IEEE conference 模板☆32Feb 9, 2025Updated last year