☆22Oct 25, 2024Updated last year
Alternatives and similar repositories for AgentAttack
Users that are interested in AgentAttack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆190Oct 31, 2025Updated 8 months ago
- ☆32Feb 27, 2025Updated last year
- ☆40Oct 2, 2024Updated last year
- ☆13Nov 17, 2024Updated last year
- Code and dataset for the paper: "Can Editing LLMs Inject Harm?" [AAAI'26]☆21Dec 26, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆139Feb 19, 2025Updated last year
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆280Jan 27, 2026Updated 5 months ago
- [CVPR2025] Official Repository for IMMUNE: Improving Safety Against Jailbreaks in Multi-modal LLMs via Inference-Time Alignment☆28Jun 11, 2025Updated last year
- [ICLR'24] RAIN: Your Language Models Can Align Themselves without Finetuning☆98May 23, 2024Updated 2 years ago
- ☆13May 18, 2024Updated 2 years ago
- [EMNLP 2024 Tutorial] Language Agents: Foundations, Prospects, and Risks☆10Nov 27, 2024Updated last year
- Official code for FAccT'21 paper "Fairness Through Robustness: Investigating Robustness Disparity in Deep Learning" https://arxiv.org/abs…☆13Mar 9, 2021Updated 5 years ago
- ☆102Mar 13, 2026Updated 3 months ago
- AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM☆87Nov 3, 2024Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Code for our paper "Defending ChatGPT against Jailbreak Attack via Self-Reminder" in NMI.☆57Nov 13, 2023Updated 2 years ago
- The AgentForge project focuses on building general tooling to construct multicapability AI systems by composing skills and models togethe…☆18Oct 11, 2023Updated 2 years ago
- [ICML 2024] Safety Fine-Tuning at (Almost) No Cost: A Baseline for Vision Large Language Models.☆89Jan 19, 2025Updated last year
- A benchmark for evaluating the robustness of LLMs and defenses to indirect prompt injection attacks.☆138Apr 15, 2024Updated 2 years ago
- ☆22May 23, 2025Updated last year
- Pytorch implementation of NPAttack☆12Jul 7, 2020Updated 5 years ago
- Official repository for "Robust Prompt Optimization for Defending Language Models Against Jailbreaking Attacks"☆62Aug 8, 2024Updated last year
- Kubernetes cli (kubectl) powered by GPT☆15Apr 20, 2023Updated 3 years ago
- True Few-Shot BioIE: Benchmarking GPT-3 In-Context and Small PLM Fine-Tuning☆12Jul 6, 2022Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Repository for USTC 2021 Spring Database Labs☆10Jul 6, 2021Updated 4 years ago
- The Melbourne Open Data Playground GitHub community page is a part of the Melbourne Open Data Playground (MOP), an industry capstone proj…☆17Sep 10, 2025Updated 9 months ago
- Agent Security Bench (ASB)☆264Apr 16, 2026Updated 2 months ago
- Image Shortcut Squeezing: Countering Perturbative Availability Poisons with Compression☆14Mar 22, 2025Updated last year
- User-controllable Recommendation Against Filter Bubbles☆18May 4, 2022Updated 4 years ago
- Automated Question-Answering Over Knowledge Graphs in O&M of Wind Turbines☆14Aug 16, 2022Updated 3 years ago
- Make LLM can control your PC or Server with ssh or terminal.☆27Sep 17, 2025Updated 9 months ago
- ☆18Jan 3, 2025Updated last year
- [ICLR 2023, Spotlight] Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning☆31Dec 2, 2023Updated 2 years ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Spectral Graph Attention Network with Fast Eigen-approximation☆11Dec 24, 2021Updated 4 years ago
- ☆73Feb 16, 2025Updated last year
- [ICLR 2025] Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning☆23Jul 8, 2024Updated last year
- Symmetric Encryption with Language Models☆13Jun 13, 2023Updated 3 years ago
- The MobSTr dataset provides artifacts that demonstrate Model-based Safety Assurance and Traceability for a safety-critical automotive sys…☆10Mar 18, 2022Updated 4 years ago
- Implementation for <Understanding Robust Overftting of Adversarial Training and Beyond> in ICML'22.☆14Jul 1, 2022Updated 4 years ago
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema Expansion☆13Jul 26, 2023Updated 2 years ago