snwen123 / LLM_Unlearning_Papers

☆24

Alternatives and similar repositories for LLM_Unlearning_Papers:

Users that are interested in LLM_Unlearning_Papers are comparing it to the libraries listed below

SALT-NLP / Efficient_Unlearning
☆36Updated last year
HillZhang1999 / ICD
Code & Data for our Paper "Alleviating Hallucinations of Large Language Models through Induced Hallucinations"
☆62Updated 10 months ago
launchnlp / LitCab
☆21Updated 3 months ago
dannyallover / overthinking_the_truth
☆29Updated 8 months ago
joeljang / knowledge-unlearning
[ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models
☆79Updated 4 months ago
eric-mitchell / serac
Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model
☆66Updated 2 years ago
SALT-NLP / chain-of-thought-bias
☆24Updated 3 months ago
deeplearning-wisc / picle
Official code for ICML 2024 paper on Persona In-Context Learning (PICLe)
☆23Updated 6 months ago
yizhongw / llm-temporal-alignment
Methods and evaluation for aligning language models temporally
☆27Updated 10 months ago
pkunlp-icler / IKE
☆23Updated last year
jinzhuoran / RWKU
RWKU: Benchmarking Real-World Knowledge Unlearning for Large Language Models. NeurIPS 2024
☆65Updated 3 months ago
OhadRubin / EPR
☆60Updated 2 years ago
princeton-nlp / MABEL
EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975
☆37Updated last year
hongshi97 / CAD
Unofficial re-implementation of "Trusting Your Evidence: Hallucinate Less with Context-aware Decoding"
☆28Updated last month
declare-lab / resta
Restore safety in fine-tuned language models through task arithmetic
☆26Updated 9 months ago
Shark-NLP / self-adaptive-ICL
self-adaptive in-context learning
☆42Updated last year
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆57Updated last year
Zce1112zslx / IKE
☆40Updated last year
lifan-yuan / OOD_NLP
[NeurIPS 2023 D&B Track] Code and data for paper "Revisiting Out-of-distribution Robustness in NLP: Benchmarks, Analysis, and LLMs Evalua…
☆31Updated last year
licong-lin / negative-preference-optimization
☆44Updated 6 months ago
princeton-nlp / MQuAKE
[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
☆105Updated 4 months ago
minicheshire / Robust-Prefix-Tuning
code for the ICLR'22 paper: On Robust Prefix-Tuning for Text Classification
☆27Updated 2 years ago
BunsenFeng / AbstainQA
AbstainQA, ACL 2024
☆25Updated 3 months ago
hkust-nlp / PEM_composition
[NeurIPS 2023] Github repository for "Composing Parameter-Efficient Modules with Arithmetic Operations"
☆58Updated last year
google-research-datasets / GSM-IC
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…
☆58Updated last year
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆53Updated 3 months ago
kttian / llm_factuality_tuning
☆29Updated 8 months ago
AkariAsai / ATTEMPT
This is the oficial repository for "Parameter-Efficient Multi-task Tuning via Attentional Mixtures of Soft Prompts" (EMNLP 2022)
☆100Updated 2 years ago
fc2869 / lo-fit
LoFiT: Localized Fine-tuning on LLM Representations
☆30Updated this week
genglinliu / UnknownBench
Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge
☆12Updated 10 months ago