IAAR-Shanghai/SafeRAG

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/IAAR-Shanghai/SafeRAG)

IAAR-Shanghai / SafeRAG

☆60

Alternatives and similar repositories for SafeRAG

Users that are interested in SafeRAG are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

IAAR-Shanghai / SEAP
View on GitHub
☆23Jun 10, 2025Updated last year
IAAR-Shanghai / PGRAG
View on GitHub
PGRAG
☆53Jul 16, 2024Updated 2 years ago
sleeepeer / PoisonedRAG
View on GitHub
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆285Jan 27, 2026Updated 5 months ago
HuichiZhou / TrustRAG
View on GitHub
Code for "TrustRAG: Enhancing Robustness and Trustworthiness in RAG" AAAI 2026 Workshop on Trust and Control in Agentic AI (TrustAgent)
☆60Mar 24, 2025Updated last year
inspire-group / RobustRAG
View on GitHub
☆31Sep 15, 2024Updated last year
GPUs on demand by Runpod - Special Offer Available • Ad
Run AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
IAAR-Shanghai / ICSFSurvey
View on GitHub
Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…
☆173Dec 7, 2024Updated last year
IAAR-Shanghai / Grimoire
View on GitHub
Grimoire is All You Need for Enhancing Large Language Models
☆120Feb 29, 2024Updated 2 years ago
IAAR-Shanghai / xFinder
View on GitHub
[ICLR 2025] xFinder: Large Language Models as Automated Evaluators for Reliable Evaluation
☆178Nov 14, 2025Updated 8 months ago
MemTensor / HaluMem
View on GitHub
HaluMem is the first operation level hallucination evaluation benchmark tailored to agent memory systems.
☆148Apr 30, 2026Updated 2 months ago
VMnK-Run / MARVEL
View on GitHub
[ASE2024] Mutual Learning-Based Framework for Enhancing Robustness of Code Models via Adversarial Training
☆11Sep 13, 2024Updated last year
IAAR-Shanghai / CRUD_RAG
View on GitHub
CRUD-RAG: A Comprehensive Chinese Benchmark for Retrieval-Augmented Generation of Large Language Models
☆399May 20, 2025Updated last year
HyeonjeongHa / MM-PoisonRAG
View on GitHub
Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"
☆16Dec 4, 2025Updated 7 months ago
zhangbl6618 / RAG-Responsibility-Attribution
View on GitHub
Official Implementation of "Who Taught the Lie? Responsibility Attribution for Poisoned Knowledge in Retrieval-Augmented Generation" and …
☆20Dec 24, 2025Updated 6 months ago
PKU-ML / PAT
View on GitHub
Code for NeurIPS 2024 Paper "Fight Back Against Jailbreaking via Prompt Adversarial Tuning"
☆22May 6, 2025Updated last year
Proton VPN Special Offer - Get 70% off • Ad
Special partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
hannxu123 / fair_robust
View on GitHub
☆12Apr 27, 2022Updated 4 years ago
Arstanley / Awesome-Trustworthy-RAG
View on GitHub
☆116Jun 1, 2026Updated last month
OSU-NLP-Group / EIA_against_webagent
View on GitHub
☆40Oct 2, 2024Updated last year
jonnypei / acl23-preadd
View on GitHub
☆12Jul 25, 2023Updated 2 years ago
bigglesworthnotacat / LLM-Steg
View on GitHub
[ICLR 2026 Oral] Invisible Safety Threat: Malicious Finetuning for LLM via Steganography
☆20Mar 22, 2026Updated 3 months ago
LauJames / Topic-FlipRAG
View on GitHub
[USENIX Security 2025] Topic-FlipRAG: Topic-Orientated Adversarial Opinion Manipulation Attacks to Retrieval-Augmented Generation Models
☆15Jun 21, 2025Updated last year
roywang021 / IDEATOR
View on GitHub
Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
☆18Jul 11, 2025Updated last year
Ki-Seki / Awesome-Transformer-Visualization
View on GitHub
Explore visualization tools for understanding Transformer-based large language models (LLMs)
☆27Dec 1, 2024Updated last year
CTZhou-byte / TrojanRAG
View on GitHub
☆24Jan 6, 2025Updated last year
Managed Kubernetes at scale on DigitalOcean • Ad
DigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
real-absolute-AI / Unnatural_Language
View on GitHub
The official repository of 'Unnatural Language Are Not Bugs but Features for LLMs'
☆24May 20, 2025Updated last year
LFYSec / AgentFuzz
View on GitHub
The source code of [Sec'25] Make Agent Defeat Agent: Automatic Detection of Taint-Style Vulnerabilities in LLM-based Agents
☆95Apr 13, 2026Updated 3 months ago
aifinlab / Spider-Sense
View on GitHub
☆21Feb 6, 2026Updated 5 months ago
brian-lou / Training-Data-Extraction-Attack-on-LLMs
View on GitHub
This project explores training data extraction attacks on the LLaMa 7B, GPT-2XL, and GPT-2-IMDB models to discover memorized content usin…
☆15Jun 15, 2023Updated 3 years ago
chichidd / llm-lora-trojan
View on GitHub
Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"
☆33Sep 11, 2024Updated last year
MemTensor / text2mem
View on GitHub
Text2Mem: A Unified Memory Operation Language for Memory Operating System
☆55Jan 7, 2026Updated 6 months ago
OSU-NLP-Group / AmpleGCG
View on GitHub
AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM
☆87Nov 3, 2024Updated last year
flowvqa / flowvqa
View on GitHub
The official dataset of the flowvqa project.
☆24Mar 26, 2024Updated 2 years ago
wzh99 / GenCoG
View on GitHub
GenCoG: A DSL-Based Approach to Generating Computation Graphs for TVM Testing (ISSTA‘23)
☆17Jul 19, 2023Updated 3 years ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
TrustMLRG / GASP
View on GitHub
GASP: Efficient Black-Box Generation of Adversarial Suffixes for Jailbreaking LLMs
☆16Nov 12, 2025Updated 8 months ago
NicerWang / Joint-GCG
View on GitHub
[AAAI 2026] Official implementation of "Joint-GCG: Unified Gradient-Based Poisoning Attacks on Retrieval-Augmented Generation Systems".
☆18Mar 23, 2026Updated 3 months ago
jxnl / mit-lecture
View on GitHub
☆10Feb 25, 2025Updated last year
shiningrain / JailGuard
View on GitHub
☆32Mar 16, 2025Updated last year
kangmintong / R-2-Guard
View on GitHub
[ICLR 2025] Code implementation of R^2-Guard: Robust Reasoning Enabled LLM Guardrail via Knowledge-Enhanced Logical Reasoning
☆23Jul 8, 2024Updated 2 years ago
Tsinghua-dhy / EDC-2-RAG
View on GitHub
☆19Nov 3, 2025Updated 8 months ago
cnlinxi / LLM-paper-daily
View on GitHub
Automatically Update LLM Papers Daily using Github Actions. Ref: https://github.com/Vincentqyw/cv-arxiv-daily
☆10Updated this week