IAAR-Shanghai / SafeRAG
☆23Updated 3 weeks ago
Alternatives and similar repositories for SafeRAG:
Users that are interested in SafeRAG are comparing it to the libraries listed below
- PGRAG☆47Updated 8 months ago
- 【ACL 2024】 SALAD benchmark & MD-Judge☆136Updated 3 weeks ago
- Official github repo for AutoDetect, an automated weakness detection framework for LLMs.☆42Updated 9 months ago
- ☆35Updated 2 months ago
- [ICLR 2025] This is the code repo for our ICLR’25 paper "RAG-DDR: Optimizing Retrieval-Augmented Generation Using Differentiable Data Rew…☆31Updated last month
- The code of arxiv paper: "CoT-based Synthesizer: Enhancing LLM Performance through Answer Synthesis"☆23Updated 2 months ago
- Code Repo for EfficientRAG: Efficient Retriever for Multi-Hop Question Answering☆43Updated 3 weeks ago
- S-Eval: Automatic and Adaptive Test Generation for Benchmarking Safety Evaluation of Large Language Models☆58Updated last month
- [NDSS'25 Poster] A collection of automated evaluators for assessing jailbreak attempts.☆133Updated 3 weeks ago
- Explore concepts like Self-Correct, Self-Refine, Self-Improve, Self-Contradict, Self-Play, and Self-Knowledge, alongside o1-like reasonin…☆164Updated 3 months ago
- The demo, code and data of FollowRAG☆70Updated 3 months ago
- ☆18Updated 2 weeks ago
- ☆47Updated last month
- Small Models, Big Insights: Leveraging Slim Proxy Models To Decide When and What to Retrieve for LLMs (ACL 2024)☆60Updated 5 months ago
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆124Updated 8 months ago
- The code and data of DPA-RAG☆58Updated 2 months ago
- ☆15Updated 9 months ago
- Controllable Text Generation for Large Language Models: A Survey☆164Updated 7 months ago
- Fast Memorization of Prompt Improves Context Awareness of Large Language Models (Findings of EMNLP 2024)☆20Updated 5 months ago
- [ACL 2024] An Easy-to-use Hallucination Detection Framework for LLMs.☆30Updated last month
- ☆138Updated 2 weeks ago
- R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)☆68Updated last month
- [ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales☆80Updated last month
- Official github repo for SafetyBench, a comprehensive benchmark to evaluate LLMs' safety. [ACL 2024]☆208Updated 9 months ago
- ☆123Updated 7 months ago
- Welcome to the Table Meets LLM repository!☆32Updated 2 months ago
- ☆82Updated last week
- ShieldLM: Empowering LLMs as Aligned, Customizable and Explainable Safety Detectors [EMNLP 2024 Findings]☆179Updated 6 months ago
- [NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"☆105Updated 2 months ago
- Code for the 2024 arXiv publication "Fine-Tuning with Divergent Chains of Thought Boosts Reasoning Through Self-Correction in Language Mo…☆23Updated 8 months ago