LavinWong / Fairness-in-Large-Language-Models

Fairness in LLMs resources

☆22

Alternatives and similar repositories for Fairness-in-Large-Language-Models:

Users that are interested in Fairness-in-Large-Language-Models are comparing it to the libraries listed below

KID-22 / LLM-IR-Bias-Fairness-Survey
This is the repo for the survey of Bias and Fairness in IR with LLMs.
☆52Updated 3 weeks ago
AI4LIFE-GROUP / LLM_Explainer
Code for paper: Are Large Language Models Post Hoc Explainers?
☆31Updated 9 months ago
i-gallegos / Fair-LLM-Benchmark
☆131Updated last year
TimeLovercc / CAF-GNN
[CIKM 2023] Towards Fair Graph Neural Networks via Graph Counterfactual.
☆13Updated last month
Arstanley / Awesome-Trustworthy-RAG
☆54Updated last month
agiresearch / TrustAgent
TrustAgent: Towards Safe and Trustworthy LLM-based Agents
☆40Updated 2 months ago
llm-misinformation / llm-misinformation
The dataset and code for the ICLR 2024 paper "Can LLM-Generated Misinformation Be Detected?"
☆63Updated 5 months ago
JacksonWuxs / UsableXAI_LLM
Using Explanations as a Tool for Advanced LLMs
☆60Updated 7 months ago
jma712 / GEAR
☆18Updated 3 years ago
SALT-NLP / Efficient_Unlearning
☆38Updated last year
David-Li0406 / AI-Supervision-Risk
☆19Updated last month
jinhaoduan / SAR
[ACL 2024] Shifting Attention to Relevance: Towards the Predictive Uncertainty Quantification of Free-Form Large Language Models
☆47Updated 7 months ago
isail-laboratory / iDEA-iSAIL-Reading-Group
☆26Updated this week
zepingyu0512 / awesome-SAE
awesome SAE papers
☆26Updated 2 months ago
llm-misinformation / llm-misinformation-survey
Paper list for the survey "Combating Misinformation in the Age of LLMs: Opportunities and Challenges" and the initiative "LLMs Meet Misin…
☆99Updated 5 months ago
LeMei / CausalNlp-ReadingGroup
repository for Causal&NLP works
☆10Updated 2 months ago
pillowsofwind / Knowledge-Conflicts-Survey
[EMNLP 2024] The official GitHub repo for the survey paper "Knowledge Conflicts for LLMs: A Survey"
☆112Updated 7 months ago
princeton-nlp / benign-data-breaks-safety
☆35Updated 6 months ago
ventr1c / RES-GCL
An official PyTorch implementation of "Certifiably Robust Graph Contrastive Learning" (NeurIPS 2023)
☆10Updated last year
changdaeoh / FarconVAE
Official implementation for KDD'22 paper "Learning Fair Representation via Distributional Contrastive Disentanglement"
☆23Updated 2 years ago
Lingzhi-WANG / KGAUnlearn
☆16Updated last year
snw2021 / LLM_Unlearning_Papers
☆26Updated last year
jamqd / Group-Preference-Optimization
☆18Updated last year
VITA-Group / LLaGA
[ICML2024] "LLaGA: Large Language and Graph Assistant", Runjin Chen, Tong Zhao, Ajay Jaiswal, Neil Shah, Zhangyang Wang
☆109Updated 7 months ago
CharlesYu2000 / PCGU-UnlearningBias
☆16Updated last year
vinid / safety-tuned-llamas
ICLR2024 Paper. Showing properties of safety tuning and exaggerated safety.
☆80Updated 11 months ago
balevinstein / Probes
☆49Updated last year
junchenzhi / Awesome-LLM-Ensemble
A curated list of Awesome-LLM-Ensemble papers for the survey "Harnessing Multiple Large Language Models: A Survey on LLM Ensemble"
☆37Updated this week
ZhiningLiu1998 / SelfElicit
SelfElicit: Your Language Model Secretly Knows Where is the Relevant Evidence!
☆9Updated 2 months ago
zepingyu0512 / neuron-attribution
code for EMNLP 2024 paper: Neuron-Level Knowledge Attribution in Large Language Models
☆30Updated 5 months ago