shreyansh26 / Extracting-Training-Data-from-Large-Langauge-ModelsLinks

A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020

☆37

Alternatives and similar repositories for Extracting-Training-Data-from-Large-Langauge-Models

Users that are interested in Extracting-Training-Data-from-Large-Langauge-Models are comparing it to the libraries listed below

Sorting:

ftramer / LM_Memorization
Training data extraction on GPT-2
☆193Updated 2 years ago
jeffhj / LM_PersonalInfoLeak
The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
☆24Updated 3 years ago
google-research / lm-extraction-benchmark
☆293Updated 3 months ago
neulab / RIPPLe
Code for the paper "Weight Poisoning Attacks on Pre-trained Models" (ACL 2020)
☆142Updated last month
wyshi / lm_privacy
☆21Updated 4 years ago
mireshghallah / ft-memorization
☆13Updated 3 years ago
amazon-science / controlling-llm-memorization
☆38Updated 2 years ago
VITA-Group / DP-OPT
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
☆46Updated last year
leix28 / prompt-universal-vulnerability
Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022
☆31Updated 3 years ago
microsoft / analysing_pii_leakage
The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…
☆101Updated last year
AlexWan0 / Poisoning-Instruction-Tuned-Models
☆56Updated last year
pratyushmaini / llm_dataset_inference
Official Repository for Dataset Inference for LLMs
☆43Updated last year
csong27 / collision-bert
☆25Updated 5 years ago
facebookresearch / text-adversarial-attack
Repo for arXiv preprint "Gradient-based Adversarial Attacks against Text Transformers"
☆109Updated 2 years ago
parameterlab / mia-scaling
Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"
☆15Updated 9 months ago
microsoft / dp-transformers
Differentially-private transformers using HuggingFace and Opacus
☆143Updated last year
lancopku / Embedding-Poisoning
Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…
☆43Updated 4 years ago
Hazelsuko07 / TextHide
TextHide: Tackling Data Privacy in Language Understanding Tasks
☆31Updated 4 years ago
lxuechen / private-transformers
A codebase that makes differentially private training of transformers easy.
☆178Updated 2 years ago
thunlp / StyleAttack
Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"
☆46Updated 3 years ago
alvinchangw / CARA_EMNLP2020
Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)
☆15Updated 5 years ago
weichen-yu / LM-Extraction
☆43Updated 2 years ago
ALFA-group / adversarial-code-generation
[ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shi…
☆30Updated 4 years ago
QData / deepWordBug
CodeBase for Paper: "Black-box Generation of Adversarial Text Sequences to Evade Deep Learning Classifiers", / Interactive Demo @
☆79Updated 2 years ago
safr-ai-lab / survey-llm
A survey of privacy problems in Large Language Models (LLMs). Contains summary of the corresponding paper along with relevant code
☆68Updated last year
Princeton-SysML / kNNLM_privacy
Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888
☆37Updated last year
azshue / AutoPoison
The official repository of the paper "On the Exploitability of Instruction Tuning".
☆65Updated last year
huseyinatahaninan / Differentially-Private-Fine-tuning-of-Language-Models
☆76Updated 3 years ago
HKUST-KnowComp / GEIA
Code for Findings-ACL 2023 paper: Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Rec…
☆48Updated last year
xiangyue9607 / SanText
Code for Findings of ACL 2021 "Differential Privacy for Text Analytics via Natural Text Sanitization"
☆29Updated 3 years ago