junxu-ai / LLM_fairnessLinks

Collection of papers, tools, datasets for fairness of LLM

☆16

Alternatives and similar repositories for LLM_fairness

Users that are interested in LLM_fairness are comparing it to the libraries listed below

Sorting:

Xianjun-Yang / Awesome_papers_on_LLMs_detection
The lastest paper about detection of LLM-generated text and code
☆278Updated 3 months ago
i-gallegos / Fair-LLM-Benchmark
☆152Updated 2 years ago
LiuYuHan31 / FPS
☆20Updated last year
chrisliu298 / awesome-llm-unlearning
A resource repository for machine unlearning in large language models
☆495Updated 2 months ago
CryptoAILab / Awesome-LM-SSP
A reading list for large models safety, security, and privacy (including Awesome LLM Security, Safety, etc.).
☆1,695Updated this week
NLP2CT / DetectRL
[NeurIPS 2024 D&B] DetectRL: Benchmarking LLM-Generated Text Detection in Real-World Scenarios
☆33Updated 10 months ago
ICTMCG / LLM-for-misinformation-research
Paper list of misinformation research using (multi-modal) large language models, i.e., (M)LLMs.
☆294Updated 10 months ago
ydyjya / Awesome-LLM-Safety
A curated list of safety-related papers, articles, and resources focused on Large Language Models (LLMs). This repository aims to provide…
☆1,635Updated this week
git-disl / awesome_LLM-harmful-fine-tuning-papers
A survey on harmful fine-tuning attack for large language model
☆216Updated last week
zepingyu0512 / awesome-llm-understanding-mechanism
awesome papers in LLM interpretability
☆556Updated last month
ledllm / ledllm
☆21Updated last year
niconi19 / LLM-Conversation-Safety
[NAACL2024] Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey
☆106Updated last year
ICTMCG / Awesome-Machine-Generated-Text
Continuously updated list of related resources for generative LLMs like GPT and their analysis and detection.
☆226Updated 4 months ago
AmourWaltz / Reliable-LLM
☆165Updated last year
zepingyu0512 / awesome-LLM-neuron
☆30Updated 4 months ago
liudaizong / Awesome-LVLM-Attack
😎 up-to-date & curated list of awesome Attacks on Large-Vision-Language-Models papers, methods & resources.
☆398Updated this week
hzy312 / Awesome-LLM-Watermark
UP-TO-DATE LLM Watermark paper. 🔥🔥🔥
☆357Updated 10 months ago
Nicozwy / AIGTD-Survey
The official GitHub page for the survey paper of AIGTD entitled "The Imitation Game Revisited: A Comprehensive Survey on Recent Advances …
☆40Updated 7 months ago
DUT-lujunyu / ToxiCN
The code and resource of "Facilitating Fine-grained Detection of Chinese Toxic Language: Hierarchical Taxonomy, Resources, and Benchmark"…
☆86Updated 4 months ago
ydyjya / LLM-IHS-Explanation
☆53Updated last year
HillZhang1999 / llm-hallucination-survey
Reading list of hallucination in LLMs. Check out our new survey paper: "Siren’s Song in the AI Ocean: A Survey on Hallucination in Large …
☆1,051Updated 3 weeks ago
acl-org / acl-style-files
Official style files for papers submitted to venues of the Association for Computational Linguistics
☆1,223Updated last month
chawins / llm-sp
Papers and resources related to the security and privacy of LLMs 🤖
☆536Updated 4 months ago
zjunlp / KnowledgeEditingPapers
Must-read Papers on Knowledge Editing for Large Language Models.
☆1,173Updated 3 months ago
NY1024 / Foundation-Model-Paper-Notes
☆65Updated 4 months ago
Cartus / Automated-Fact-Checking-Resources
Links to conference/journal publications in automated fact-checking (resources for the TACL22/EMNLP23 paper).
☆534Updated 7 months ago
kevinyaobytedance / llm_unlearn
LLM Unlearning
☆175Updated last year
sleeepeer / PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆205Updated 7 months ago
ValueByte-AI / Awesome-LLM-in-Social-Science
Awesome papers involving LLMs in Social Science.
☆543Updated 3 weeks ago
TrustGen / TrustEval-toolkit
Toolkit for evaluating the trustworthiness of generative foundation models.
☆119Updated last month