jeffhj / LM_PersonalInfoLeakLinks

The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)

☆24

Alternatives and similar repositories for LM_PersonalInfoLeak

Users that are interested in LM_PersonalInfoLeak are comparing it to the libraries listed below

Sorting:

ftramer / LM_Memorization
Training data extraction on GPT-2
☆193Updated 2 years ago
mireshghallah / ft-memorization
☆13Updated 3 years ago
pratyushmaini / llm_dataset_inference
Official Repository for Dataset Inference for LLMs
☆43Updated last year
wyshi / lm_privacy
☆21Updated 4 years ago
parameterlab / mia-scaling
Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"
☆15Updated 9 months ago
leix28 / prompt-universal-vulnerability
Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022
☆31Updated 3 years ago
xiangyue9607 / Sentence-LDP
Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"
☆12Updated 2 years ago
shreyansh26 / Extracting-Training-Data-from-Large-Langauge-Models
A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020
☆37Updated 3 years ago
AlexWan0 / Poisoning-Instruction-Tuned-Models
☆56Updated last year
facebookresearch / text-adversarial-attack
Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"
☆109Updated 2 years ago
google-research / lm-extraction-benchmark
☆293Updated 3 months ago
ejones313 / auditing-llms
☆59Updated 2 years ago
csong27 / collision-bert
☆25Updated 5 years ago
microsoft / analysing_pii_leakage
The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…
☆101Updated last year
VITA-Group / DP-OPT
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
☆46Updated last year
lxuechen / private-transformers
A codebase that makes differentially private training of transformers easy.
☆178Updated 2 years ago
weichen-yu / LM-Extraction
☆43Updated 2 years ago
iamgroot42 / mimir
Python package for measuring memorization in LLMs.
☆173Updated 4 months ago
huseyinatahaninan / Differentially-Private-Fine-tuning-of-Language-Models
☆76Updated 3 years ago
microsoft / dp-transformers
Differentially-private transformers using HuggingFace and Opacus
☆143Updated last year
amazon-science / controlling-llm-memorization
☆38Updated 2 years ago
Vaidehi99 / InfoDeletionAttacks
☆47Updated 9 months ago
centerforaisafety / tdc2023-starter-kit
This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
☆89Updated last year
mireshghallah / neighborhood-curvature-mia
☆23Updated 2 years ago
ebagdasa / propaganda_as_a_service
Code for paper: "Spinning Language Models: Risks of Propaganda-as-a-Service and Countermeasures"
☆21Updated 3 years ago
skywalker023 / confaide
🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…
☆48Updated last year
declare-lab / red-instruct
Codes and datasets of the paper Red-Teaming Large Language Models using Chain of Utterances for Safety-Alignment
☆107Updated last year
aaronmueller / MIB
Landing page for MIB: A Mechanistic Interpretability Benchmark
☆21Updated 3 months ago
eth-sri / SynthPAI
A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)
☆45Updated 3 months ago
neulab / RIPPLe
Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)
☆142Updated last month