microsoft / analysing_pii_leakageLinks

The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word prediction language models.

☆101

Alternatives and similar repositories for analysing_pii_leakage

Users that are interested in analysing_pii_leakage are comparing it to the libraries listed below

Sorting:

safr-ai-lab / survey-llm
A survey of privacy problems in Large Language Models (LLMs). Contains summary of the corresponding paper along with relevant code
☆68Updated last year
microsoft / dp-transformers
Differentially-private transformers using HuggingFace and Opacus
☆143Updated last year
huseyinatahaninan / Differentially-Private-Fine-tuning-of-Language-Models
☆75Updated 3 years ago
byerose / Awesome-Foundation-Model-Security
A curated list of trustworthy Generative AI papers. Daily updating...
☆74Updated last year
eth-sri / llmprivacy
☆69Updated 8 months ago
QinbinLi / LLM-PBE
A toolkit to assess data privacy in LLMs (under development)
☆62Updated 9 months ago
lancopku / agent-backdoor-attacks
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
☆93Updated last year
iamgroot42 / mimir
Python package for measuring memorization in LLMs.
☆170Updated 3 months ago
arobey1 / smooth-llm
☆111Updated last year
VITA-Group / DP-OPT
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
☆46Updated last year
microsoft / dp-few-shot-generation
☆28Updated last year
phycholosogy / RAG-privacy
The code for paper "The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)", exploring the privacy risk o…
☆55Updated 8 months ago
sleeepeer / PoisonedRAG
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆209Updated 8 months ago
wegodev2 / virtual-prompt-injection
Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"
☆23Updated last year
facebookresearch / SecAlign
Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"
☆72Updated 3 months ago
thunlp / OpenBackdoor
An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)
☆196Updated 2 years ago
CryptoAILab / JailbreakEval
[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.
☆172Updated 6 months ago
lxuechen / private-transformers
A codebase that makes differentially private training of transformers easy.
☆176Updated 2 years ago
facebookresearch / advprompter
Official implementation of AdvPrompter https//arxiv.org/abs/2404.16873
☆168Updated last year
AI-secure / DecodingTrust
A Comprehensive Assessment of Trustworthiness in GPT Models
☆303Updated last year
uw-nsl / SafeDecoding
Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding
☆146Updated last year
Vaidehi99 / InfoDeletionAttacks
☆46Updated 8 months ago
Princeton-SysML / Jailbreak_LLM
☆185Updated last year
ftramer / LM_Memorization
Training data extraction on GPT-2
☆193Updated 2 years ago
poloclub / llm-self-defense
LLM Self Defense: By Self Examination, LLMs know they are being tricked
☆43Updated last year
BHui97 / PLeak
☆65Updated 10 months ago
AI-secure / aug-pe
[ICML 2024 Spotlight] Differentially Private Synthetic Data via Foundation Model APIs 2: Text
☆46Updated 9 months ago
Yu-Fangxu / COLD-Attack
[ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
☆166Updated 10 months ago
Django-Jiang / BadChain
[ICLR24] Official Repo of BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models
☆39Updated last year
AI-secure / AgentPoison
[NeurIPS 2024] Official implementation for "AgentPoison: Red-teaming LLM Agents via Memory or Knowledge Base Backdoor Poisoning"
☆160Updated 6 months ago