computationalprivacy / mia_llms_benchmarkLinks

Benchmarking MIAs against LLMs.

☆22

Alternatives and similar repositories for mia_llms_benchmark

Users that are interested in mia_llms_benchmark are comparing it to the libraries listed below

Sorting:

parameterlab / mia-scaling
Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"
☆14Updated 8 months ago
QinbinLi / LLM-PBE
A toolkit to assess data privacy in LLMs (under development)
☆62Updated 9 months ago
iamgroot42 / mimir
Python package for measuring memorization in LLMs.
☆167Updated 2 months ago
centerforaisafety / tdc2023-starter-kit
This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.
☆89Updated last year
huseyinatahaninan / Differentially-Private-Fine-tuning-of-Language-Models
☆75Updated 3 years ago
ftramer / LM_Memorization
Training data extraction on GPT-2
☆192Updated 2 years ago
aounon / certified-llm-safety
☆44Updated last year
lxuechen / private-transformers
A codebase that makes differentially private training of transformers easy.
☆176Updated 2 years ago
jthickstun / watermark
Code for watermarking language models
☆82Updated last year
microsoft / dp-transformers
Differentially-private transformers using HuggingFace and Opacus
☆143Updated last year
ejones313 / auditing-llms
☆58Updated 2 years ago
Princeton-SysML / FILM
Official repo for the paper: Recovering Private Text in Federated Learning of Language Models (in NeurIPS 2022)
☆60Updated 2 years ago
pratyushmaini / llm_dataset_inference
Official Repository for Dataset Inference for LLMs
☆41Updated last year
locuslab / acr-memorization
☆37Updated 9 months ago
eth-sri / SynthPAI
A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)
☆44Updated 2 months ago
microsoft / dp-few-shot-generation
☆28Updated last year
wyshi / lm_privacy
☆21Updated 4 years ago
mireshghallah / ft-memorization
☆13Updated 2 years ago
microsoft / analysing_pii_leakage
The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…
☆101Updated last year
wagner-group / MarkMyWords
☆30Updated last year
Vaidehi99 / InfoDeletionAttacks
☆46Updated 8 months ago
VITA-Group / DP-OPT
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
☆46Updated last year
eth-sri / llmprivacy
☆68Updated 7 months ago
sophie-xhonneux / Continuous-AdvTrain
☆29Updated last month
locuslab / open-unlearning
[NeurIPS D&B '25] The one-stop repository for large language model (LLM) unlearning. Supports TOFU, MUSE, WMDP, and many unlearning metho…
☆384Updated this week
eth-sri / lamp
LAMP: Extracting Text from Gradients with Language Model Priors (NeurIPS '22)
☆26Updated 4 months ago
ykwon0407 / DataInf
DataInf: Efficiently Estimating Data Influence in LoRA-tuned LLMs and Diffusion Models (ICLR 2024)
☆76Updated last year
safr-ai-lab / survey-llm
A survey of privacy problems in Large Language Models (LLMs). Contains summary of the corresponding paper along with relevant code
☆68Updated last year
facebookresearch / advprompter
Official implementation of AdvPrompter https//arxiv.org/abs/2404.16873
☆166Updated last year
princeton-polaris-lab / Evaluating-Durable-Safeguards
[ICLR 2025] On Evluating the Durability of Safegurads for Open-Weight LLMs
☆13Updated 3 months ago