SafeArena is a benchmark for assessing the harmful capabilities of web agents
☆22Apr 23, 2025Updated last year
Alternatives and similar repositories for safearena
Users that are interested in safearena are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Code for "Can Retriever-Augmented Language Models Reason? The Blame Game Between the Retriever and the Language Model", EMNLP Findings 20…☆28Nov 2, 2023Updated 2 years ago
- Paper: Lexicon Learning for Few-Shot Neural Sequence Modeling☆16Jan 8, 2022Updated 4 years ago
- OSWorld-Human: Benchmarking the Efficiency of Computer-Use Agents☆24Jan 6, 2026Updated 4 months ago
- virtual node analysis on ogb benchmark dataset☆14Mar 9, 2023Updated 3 years ago
- Extracting Cultural Commonsense Knowledge at Scale (WWW 2023)☆11Feb 15, 2024Updated 2 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- FeedbackQA: Improving Question Answering Post-Deployment with Interactive Feedback☆12Jul 13, 2022Updated 3 years ago
- PyTorch reimplementation of REALM and ORQA☆22Feb 3, 2022Updated 4 years ago
- Hyperstar: Negative Sampling Improves Hypernymy Extraction Based on Projection Learning.☆24Jan 2, 2020Updated 6 years ago
- Attempt at reproducing a SGNN's projection layer, but with word n-grams instead of skip-grams. Paper and more: http://aclweb.org/antholog…☆22Nov 6, 2022Updated 3 years ago
- Code and Data for "Evaluating Correctness and Faithfulness of Instruction-Following Models for Question Answering"☆87Aug 12, 2024Updated last year
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- A basic, free, ad-less, PWA-ready, open-source QR Code generator☆18Aug 12, 2025Updated 8 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.☆28Dec 16, 2024Updated last year
- Implementation of the Mask R-CNN model using OCaml's numerical library Owl.☆19Jan 30, 2020Updated 6 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- customized dash NGL viewer☆12Jan 6, 2023Updated 3 years ago
- Project of ACL 2025 "UAlign: Leveraging Uncertainty Estimations for Factuality Alignment on Large Language Models"☆14Mar 25, 2025Updated last year
- Curated list of awesome ML Visualization Libraries☆14Jun 23, 2023Updated 2 years ago
- Official repository for WWW'24 paper "MemeCraft: Contextual and Stance-Driven Multimodal Meme Generation"☆12Jul 25, 2024Updated last year
- ACL 2023 paper "A Critical Evaluation of Evaluations for Long-form Question Answering"☆21Mar 22, 2024Updated 2 years ago
- AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM☆87Nov 3, 2024Updated last year
- ☆17Feb 17, 2025Updated last year
- ☆24Apr 29, 2026Updated last week
- [WSDM'2025] "MixRec: Heterogeneous Graph Collaborative Filtering"☆20Dec 19, 2024Updated last year
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Code for "On Measuring Faithfulness of Natural Language Explanations"☆22Jul 23, 2024Updated last year
- [NDSS'25] The official implementation of safety misalignment.☆19Jan 8, 2025Updated last year
- ☆25Jan 5, 2026Updated 4 months ago
- Code for the paper "The Impact of Positional Encoding on Length Generalization in Transformers", NeurIPS 2023☆139Apr 30, 2024Updated 2 years ago
- [ICLR 2026] The official code for "Doxing via the Lens: Revealing Location-related Privacy Leakage on Multi-modal Large Reasoning Models"☆26Feb 7, 2026Updated 3 months ago
- ICML2025: One Image is Worth a Thousand Words: A Usability Preservable Text-Image Collaborative Erasing Framework☆15Jun 24, 2025Updated 10 months ago
- Emotion detection in audio utilising self-supervised representations trained with Contrastive Predictive Coding (CPC).☆43Feb 16, 2022Updated 4 years ago
- [SIGIR'22] Official PyTorch implementation for "Learning to Denoise Unreliable Interactions for Graph Collaborative Filtering".☆18Oct 24, 2022Updated 3 years ago
- [COLING 2025] Official repo of paper: "Not Aligned" is Not "Malicious": Being Careful about Hallucinations of Large Language Models' Jail…☆12Jul 26, 2024Updated last year
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- [COLING 2025🔥] Evolver: Chain-of-Evolution Prompting to Boost Large Multimodal Models for Hateful Meme Detection☆17Jan 21, 2025Updated last year
- Cloud Computing Mini Project☆25Oct 15, 2017Updated 8 years ago
- Official Repo of Your Agent May Misevolve: Emergent Risks in Self-evolving LLM Agents☆74Oct 28, 2025Updated 6 months ago
- ☆14Sep 4, 2024Updated last year
- Official GitHub repository of the lecture "Multimodal Deep Learning for Recommendation", at the 2024 ACM RecSys Summer School☆12Oct 12, 2024Updated last year
- Summary of recent news recommendation papers.☆25Feb 2, 2022Updated 4 years ago
- ☆14Dec 21, 2025Updated 4 months ago