sleeepeer/PoisonedRAG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/sleeepeer/PoisonedRAG)

sleeepeer / PoisonedRAG

[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models

☆234

Alternatives and similar repositories for PoisonedRAG

Users that are interested in PoisonedRAG are comparing it to the libraries listed below

Sorting:

princeton-nlp / corpus-poisoning
View on GitHub
[EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156
☆48Dec 14, 2023Updated 2 years ago
inspire-group / RobustRAG
View on GitHub
☆23Sep 15, 2024Updated last year
AI-secure / AgentPoison
View on GitHub
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
☆199Apr 12, 2025Updated 10 months ago
liu00222 / Open-Prompt-Injection
View on GitHub
This repository provides a benchmark for prompt injection attacks and defenses in LLMs
☆395Oct 29, 2025Updated 4 months ago
OSU-NLP-Group / AgentAttack
View on GitHub
☆23Oct 25, 2024Updated last year
chichidd / llm-lora-trojan
View on GitHub
Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"
☆27Sep 11, 2024Updated last year
lancopku / agent-backdoor-attacks
View on GitHub
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
☆109Sep 27, 2024Updated last year
GraySwanAI / nanoGCG
View on GitHub
A fast + lightweight implementation of the GCG algorithm in PyTorch
☆318May 13, 2025Updated 9 months ago
iliaishacked / sponge_examples
View on GitHub
☆28Oct 14, 2021Updated 4 years ago
ethz-spylab / rlhf-poisoning
View on GitHub
Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"
☆66Apr 24, 2024Updated last year
koo-ec / Fault_Tree_Simulink
View on GitHub
MATLAB Simulink Based Fault Tree Analyzer
☆10Mar 12, 2024Updated last year
HyeonjeongHa / MM-PoisonRAG
View on GitHub
Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"
☆12Dec 4, 2025Updated 2 months ago
RaoulHeese / qshaptools
View on GitHub
Experimental toolbox for quantum Shapley values.
☆10Jan 2, 2024Updated 2 years ago
DependableSystemsLab / MIA_defense_HAMP
View on GitHub
Code for the paper "Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction" …
☆12Sep 6, 2023Updated 2 years ago
multimodalbags / BAGS_Multimodal
View on GitHub
Backdooring Multimodal Learning
☆30May 4, 2023Updated 2 years ago
agiresearch / ASB
View on GitHub
Agent Security Bench (ASB)
☆186Oct 27, 2025Updated 4 months ago
SproutNan / AI-Safety_Benchmark
View on GitHub
The official repository for guided jailbreak benchmark
☆28Jul 28, 2025Updated 7 months ago
Sizhe-Chen / StruQ
View on GitHub
official implementation of [USENIX Sec'25] StruQ: Defending Against Prompt Injection with Structured Queries
☆63Nov 10, 2025Updated 3 months ago
CTZhou-byte / TrojanRAG
View on GitHub
☆17Jan 6, 2025Updated last year
NY1024 / Foundation-Model-Paper-Notes
View on GitHub
☆75Jan 21, 2026Updated last month
OSU-NLP-Group / AmpleGCG
View on GitHub
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
☆83Nov 3, 2024Updated last year
Snakinya / MCPCorpus
View on GitHub
MCPCorpus is a comprehensive dataset for analyzing the Model Context Protocol (MCP) ecosystem, containing ~14K MCP servers and 300 MCP cl…
☆32Sep 1, 2025Updated 6 months ago
Arstanley / Awesome-Trustworthy-RAG
View on GitHub
☆102Jul 6, 2025Updated 7 months ago
ethz-spylab / agentdojo
View on GitHub
A Dynamic Environment to Evaluate Attacks and Defenses for LLM Agents.
☆443Feb 3, 2026Updated 3 weeks ago
ydyjya / Awesome-LLM-Safety
View on GitHub
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide…
☆1,783Feb 1, 2026Updated last month
MexicanLemonade / LLM-Misinfo-QA
View on GitHub
This repository contains data and code used for On the Risk of Misinformation Pollution with Large Language Models (EMNLP 2023 Findings).
☆16Dec 14, 2023Updated 2 years ago
chawins / llm-sp
View on GitHub
Papers and resources related to the security and privacy of LLMs 🤖
☆566Jun 8, 2025Updated 8 months ago
rucnyz / LeakAgent
View on GitHub
☆29Aug 31, 2025Updated 6 months ago
sherdencooper / PromptFuzz
View on GitHub
☆29Oct 23, 2024Updated last year
HuichiZhou / TrustRAG
View on GitHub
Code for "TrustRAG: Enhancing Robustness and Trustworthiness in RAG" AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)
☆52Mar 24, 2025Updated 11 months ago
koo-ec / ECDF-based-Distance-Measure
View on GitHub
A set of functions for well-known Cumulative Distribution Function (CDF)-based distance measure
☆15Jan 5, 2024Updated 2 years ago
CryptoAILab / Awesome-LM-SSP
View on GitHub
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
☆1,870Updated this week
Gwinhen / DRUPE
View on GitHub
Distribution Preserving Backdoor Attack in Self-supervised Learning
☆20Jan 27, 2024Updated 2 years ago
TangciuYueng / AMemGuard
View on GitHub
☆38Oct 12, 2025Updated 4 months ago
KaiyuanZh / OrthogLinearBackdoor
View on GitHub
[Oakland 2024] Exploring the Orthogonality and Linearity of Backdoor Attacks
☆27Apr 15, 2025Updated 10 months ago
facebookresearch / SecAlign
View on GitHub
Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"
☆86Jul 24, 2025Updated 7 months ago
MKYucel / hybrid_augment
View on GitHub
[ICCV 2023] HybridAugment++: Unified Frequency Spectra Perturbations for Model Robustness
☆17Sep 28, 2023Updated 2 years ago
liuyugeng / ML-Doctor
View on GitHub
Code for ML Doctor
☆92Aug 14, 2024Updated last year
Explain3D / LIME-3D
View on GitHub
☆17Aug 17, 2021Updated 4 years ago