[EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
β16Sep 16, 2025Updated 6 months ago
Alternatives and similar repositories for ipiguard
Users that are interested in ipiguard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- π Sliding Window Attention Training for Efficient Large Language Modelsβ16Dec 8, 2025Updated 3 months ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.β13Nov 19, 2024Updated last year
- β40Feb 20, 2026Updated last month
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Modelβ14Dec 17, 2023Updated 2 years ago
- β15Sep 6, 2022Updated 3 years ago
- β18Nov 20, 2024Updated last year
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".β55Mar 17, 2026Updated last week
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"β13Dec 13, 2024Updated last year
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Modelsβ20Oct 24, 2024Updated last year
- Camouflage YOLO - (CAMOLO) trains adversarial patches to confuse the YOLO family of object detectors.β12Oct 20, 2022Updated 3 years ago
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24β¦β26Jun 14, 2024Updated last year
- β14Mar 9, 2025Updated last year
- Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacksβ42Feb 11, 2026Updated last month
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20β¦β15Aug 12, 2024Updated last year
- β48Feb 8, 2025Updated last year
- From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systemsβ18Nov 23, 2025Updated 4 months ago
- This approach of Intrusion Detection uses two GPT models, which are trained on normal network traffic, to predict sequences of communicatβ¦β11Oct 3, 2023Updated 2 years ago
- [ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactionsβ14Mar 7, 2026Updated 2 weeks ago
- β25Jul 27, 2024Updated last year
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.β488Mar 12, 2026Updated last week
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queriesβ65Nov 10, 2025Updated 4 months ago
- [NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agenβ¦β41Updated this week
- β15Sep 17, 2024Updated last year
- Benchmarking data and script used for LLM multi-agent collaboration systems from AWS Bedrock Agents Science team.β17Dec 10, 2024Updated last year
- Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"β18Feb 17, 2026Updated last month
- β27Feb 25, 2025Updated last year
- A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities aβ¦β44Updated this week
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agentsβ27Mar 26, 2025Updated 11 months ago
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.β26Dec 16, 2024Updated last year
- A systematic, AI-powered penetration testing reasoning engine (MCP server) for attack path planning, CTF/HTB solving, and automated penteβ¦β29Aug 30, 2025Updated 6 months ago
- β32Sep 11, 2025Updated 6 months ago
- Documenting large text datasets πΌοΈ πβ14Dec 17, 2024Updated last year
- This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago β¦β14Jan 19, 2022Updated 4 years ago
- Collection of all the papers talking about/relevant to the topic of privacy-preserving LLMsβ41Feb 10, 2025Updated last year
- Code for "When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search" (NeurIPS 2024)β18Oct 22, 2024Updated last year
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"β91Jul 24, 2025Updated 8 months ago
- An example code of implement of PGD and FGSM algorithm for adversarial attackβ12Mar 3, 2022Updated 4 years ago
- β32Jun 28, 2025Updated 8 months ago