Arstanley / Awesome-Trustworthy-RAGLinks

☆79

Alternatives and similar repositories for Awesome-Trustworthy-RAG

Users that are interested in Awesome-Trustworthy-RAG are comparing it to the libraries listed below

Sorting:

llm-misinformation / llm-misinformation-survey
Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…
☆103Updated 11 months ago
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆138Updated last year
llm-misinformation / llm-misinformation
The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"
☆76Updated 11 months ago
KID-22 / LLM-IR-Bias-Fairness-Survey
This is the repo for the survey of Bias and Fairness in IR with LLMs.
☆57Updated last month
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆175Updated last year
wang2226 / FOLK
[EMNLP 2023] Explainable Claim Verification via Knowledge-Grounded Reasoning with Large Language Models
☆24Updated last year
OSU-NLP-Group / AgentSafety
☆114Updated 5 months ago
AmourWaltz / Reliable-LLM
☆163Updated last year
TrustGen / TrustEval-toolkit
Toolkit for evaluating the trustworthiness of generative foundation models.
☆119Updated last month
LuckyyySTA / Awesome-LLM-hallucination
LLM hallucination paper list
☆323Updated last year
JiahongLiu21 / Awesome-Personalized-Large-Language-Models
☆86Updated last week
OpenSafetyLab / SALAD-BENCH
【ACL 2024】 SALAD benchmark & MD-Judge
☆161Updated 7 months ago
KID-22 / LLM-Unlearning-Paper-List
☆28Updated last year
Lordog / R-Judge
R-Judge: Benchmarking Safety Risk Awareness for LLM Agents (EMNLP Findings 2024)
☆89Updated 4 months ago
Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
☆277Updated 3 months ago
HITsz-TMG / awesome-llm-attributions
A Survey of Attributions for Large Language Models
☆216Updated last year
ZFancy / awesome-activation-engineering
A curated list of resources for activation engineering
☆105Updated last week
niconi19 / LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
☆106Updated last year
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆83Updated last year
ICTMCG / Awesome-Machine-Generated-Text
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.
☆226Updated 4 months ago
ZhiningLiu1998 / SelfElicit
[ACL'25 Main] SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence! | 让你的LLM更好地利用上下文文档：一个基于注意力的简单方案
☆23Updated 7 months ago
thu-coai / Agent-SafetyBench
☆63Updated 2 months ago
florin-git / The-Power-of-Noise
Code and data for "The Power of Noise: Redefining Retrieval for RAG Systems"
☆63Updated 3 months ago
RUCAIBox / HaluEval-2.0
☆47Updated last year
wangywUST / DeepEdit
Repository for our paper "DeepEdit: Knowledge Editing as Decoding with Constraints". https://arxiv.org/abs/2401.10471
☆21Updated last year
LaMP-Benchmark / LaMP
Codes for papers on Large Language Models Personalization (LaMP)
☆170Updated 7 months ago
usail-hkust / Jailjudge
JAILJUDGE: A comprehensive evaluation benchmark which includes a wide range of risk scenarios with complex malicious prompts (e.g., synth…
☆52Updated 9 months ago
JacksonWuxs / UsableXAI_LLM
Using Explanations as a Tool for Advanced LLMs
☆67Updated last year
agiresearch / TrustAgent
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
☆53Updated 8 months ago
weizhepei / InstructRAG
[ICLR 2025] InstructRAG: Instructing Retrieval-Augmented Generation via Self-Synthesized Rationales
☆124Updated 8 months ago