[EMNLP 2025 Oral] IPIGuard: A Novel Tool Dependency Graph-Based Defense Against Indirect Prompt Injection in LLM Agents
β18Sep 16, 2025Updated 6 months ago
Alternatives and similar repositories for ipiguard
Users that are interested in ipiguard are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Source code for the ACL'2025 paper titled "Unveiling privacy risks in llm agent memory"β29Dec 2, 2025Updated 4 months ago
- π Sliding Window Attention Training for Efficient Large Language Modelsβ16Dec 8, 2025Updated 4 months ago
- Fast and Slow Generating: An Empirical Study on Large and Small Language Models Collaborative Decoding.β13Nov 19, 2024Updated last year
- [AAAI 2024] History Matters: Temporal Knowledge Editing in Large Language Modelβ14Dec 17, 2023Updated 2 years ago
- β44Apr 7, 2026Updated last week
- Wordpress hosting with auto-scaling - Free Trial β’ AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- β18Nov 20, 2024Updated last year
- β15Sep 6, 2022Updated 3 years ago
- Codes for our paper "AgentMonitor: A Plug-and-Play Framework for Predictive and Secure Multi-Agent Systems"β13Dec 13, 2024Updated last year
- [ACL 2024 Findings] Learning Fine-Grained Grounded Citations for Attributed Large Language Modelsβ20Oct 24, 2024Updated last year
- Repo for the paper "Meta SecAlign: A Secure Foundation LLM Against Prompt Injection Attacks".β59Apr 1, 2026Updated last week
- Camouflage YOLO - (CAMOLO) trains adversarial patches to confuse the YOLO family of object detectors.β12Oct 20, 2022Updated 3 years ago
- Implementation of paper 'Reversing the Forget-Retain Objectives: An Efficient LLM Unlearning Framework from Logit Difference' [NeurIPS'24β¦β26Jun 14, 2024Updated last year
- β14Mar 9, 2025Updated last year
- Official resource for paper Investigating and Mitigating the Multimodal Hallucination Snowballing in Large Vision-Language Models (ACL 20β¦β15Aug 12, 2024Updated last year
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- β48Feb 8, 2025Updated last year
- Official release of code for the paper RL is a hammer and LLMs are nails A simple RL approach to stronger prompt injection attacksβ43Updated this week
- From Hypothesis to Publication: A Comprehensive Survey of AI-Driven Research Support Systemsβ18Nov 23, 2025Updated 4 months ago
- This approach of Intrusion Detection uses two GPT models, which are trained on normal network traffic, to predict sequences of communicatβ¦β11Oct 3, 2023Updated 2 years ago
- [ICML 2025] Speak Easy: Eliciting Harmful Jailbreaks from LLMs with Simple Interactionsβ14Mar 7, 2026Updated last month
- β16Sep 17, 2024Updated last year
- A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.β515Mar 30, 2026Updated 2 weeks ago
- Benchmarking data and script used for LLM multi-agent collaboration systems from AWS Bedrock Agents Science team.β18Dec 10, 2024Updated last year
- official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queriesβ68Nov 10, 2025Updated 5 months ago
- Virtual machines for every use case on DigitalOcean β’ AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"β18Apr 3, 2026Updated last week
- β27Feb 25, 2025Updated last year
- Codebase for Instruction Following without Instruction Tuningβ36Sep 24, 2024Updated last year
- β31Jul 27, 2024Updated last year
- [NeurIPS 2025] The official implementation of the paper "DRIFT: Dynamic Rule-Based Defense with Injection Isolation for Securing LLM Agenβ¦β46Mar 19, 2026Updated 3 weeks ago
- PFI: Prompt Flow Integrity to Prevent Privilege Escalation in LLM Agentsβ27Mar 26, 2025Updated last year
- A research workbench for developing and testing attacks against large language models, with a focus on prompt injection vulnerabilities aβ¦β47Updated this week
- We introduce EMMET and unify model editing with popular algorithms ROME and MEMIT.β27Dec 16, 2024Updated last year
- A systematic, AI-powered penetration testing reasoning engine (MCP server) for attack path planning, CTF/HTB solving, and automated penteβ¦β30Aug 30, 2025Updated 7 months ago
- Bare Metal GPUs on DigitalOcean Gradient AI β’ AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- β32Sep 11, 2025Updated 7 months ago
- Documenting large text datasets πΌοΈ πβ14Dec 17, 2024Updated last year
- This is a curated semantic version of the PASCAL-Part dataset for part-based object detection. Objects are aligned with WordNet and Yago β¦β14Jan 19, 2022Updated 4 years ago
- Code for "When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search" (NeurIPS 2024)β18Oct 22, 2024Updated last year
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"β95Updated this week
- Collection of all the papers talking about/relevant to the topic of privacy-preserving LLMsβ42Feb 10, 2025Updated last year
- An example code of implement of PGD and FGSM algorithm for adversarial attackβ12Mar 3, 2022Updated 4 years ago