eth-sri/llmprivacy

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/eth-sri/llmprivacy)

eth-sri / llmprivacy

☆71

Alternatives and similar repositories for llmprivacy

Users that are interested in llmprivacy are comparing it to the libraries listed below

Sorting:

eth-sri / privacy-inference-multimodal
View on GitHub
☆20Feb 3, 2025Updated last year
eth-sri / llm-anonymization
View on GitHub
☆21May 23, 2025Updated 9 months ago
eth-sri / SynthPAI
View on GitHub
A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)
☆52Jul 27, 2025Updated 7 months ago
niuliang42 / CodexLeaks
View on GitHub
CodexLeaks: Privacy Leaks from Code Generation Language Models in GitHub Copilot
☆11Jul 11, 2023Updated 2 years ago
mireshghallah / ft-memorization
View on GitHub
☆13Oct 20, 2022Updated 3 years ago
Lyz1213 / BadEdit
View on GitHub
☆37Oct 17, 2024Updated last year
thu-coai / Targeted-Data-Extraction
View on GitHub
Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…
☆23May 8, 2023Updated 2 years ago
zh1yu4nyu / CodeIPPrompt
View on GitHub
https://icml.cc/virtual/2023/poster/24354
☆10Aug 15, 2023Updated 2 years ago
ChenWu98 / agent-attack
View on GitHub
[ICLR 2025] Dissecting adversarial robustness of multimodal language model agents
☆130Feb 19, 2025Updated last year
fzwark / Secure_LLM_System
View on GitHub
☆14Mar 9, 2025Updated 11 months ago
microsoft / analysing_pii_leakage
View on GitHub
The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…
☆104Aug 13, 2024Updated last year
RylanSchaeffer / AstraFellowship-When-Do-VLM-Image-Jailbreaks-Transfer
View on GitHub
Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models
☆37Jun 1, 2025Updated 9 months ago
rucnyz / LeakAgent
View on GitHub
☆29Aug 31, 2025Updated 6 months ago
moranant / attacking_distributed_learning
View on GitHub
An implementation for the paper "A Little Is Enough: Circumventing Defenses For Distributed Learning" (NeurIPS 2019)
☆29Jun 29, 2023Updated 2 years ago
wang2226 / Trojan-Activation-Attack
View on GitHub
[CIKM 2024] Trojan Activation Attack: Attack Large Language Models using Activation Steering for Safety-Alignment.
☆29Jul 29, 2024Updated last year
KaiyuanZh / SOFT
View on GitHub
[USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks
☆20Sep 18, 2025Updated 5 months ago
jyhong836 / llm-dp-finetune
View on GitHub
End-to-end codebase for finetuning LLMs (LLaMA 2, 3, etc.) with or without DP
☆16Sep 23, 2024Updated last year
AI-secure / PolyGuard
View on GitHub
☆17Jun 18, 2025Updated 8 months ago
thestephencasper / explore_establish_exploit_llms
View on GitHub
☆31Jul 14, 2023Updated 2 years ago
inspire-group / tta_risk
View on GitHub
☆14Jun 6, 2023Updated 2 years ago
wbopan / safety-residual-space
View on GitHub
☆21Mar 20, 2025Updated 11 months ago
yxoh / prompt_leak_usenix2024
View on GitHub
☆14May 8, 2024Updated last year
Yu-Fangxu / COLD-Attack
View on GitHub
[ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability
☆176Dec 18, 2024Updated last year
chawins / llm-sp
View on GitHub
Papers and resources related to the security and privacy of LLMs 🤖
☆568Jun 8, 2025Updated 8 months ago
tedbackdoordefense / ted
View on GitHub
☆23Dec 14, 2023Updated 2 years ago
illidanlab / inversion-influence-function
View on GitHub
Official codes for "Understanding Deep Gradient Leakage via Inversion Influence Functions", NeurIPS 2023
☆15Oct 13, 2023Updated 2 years ago
facebookresearch / wasp
View on GitHub
Official implementation of the WASP web agent security benchmark
☆71Aug 12, 2025Updated 6 months ago
Gwinhen / DRUPE
View on GitHub
Distribution Preserving Backdoor Attack in Self-supervised Learning
☆20Jan 27, 2024Updated 2 years ago
pasquini-dario / EludingSecureAggregation
View on GitHub
Eluding Secure Aggregation in Federated Learning via Model Inconsistency
☆13Mar 10, 2023Updated 2 years ago
Sadcardation / MLLM-Refusal
View on GitHub
Repository for the Paper: Refusing Safe Prompts for Multi-modal Large Language Models
☆18Oct 16, 2024Updated last year
tribhuvanesh / prediction-poisoning
View on GitHub
Prediction Poisoning: Towards Defenses Against DNN Model Stealing Attacks (ICLR '20)
☆33Nov 4, 2020Updated 5 years ago
Leagein / memr3
View on GitHub
☆69Jan 18, 2026Updated last month
qizhangli / MoreBayesian-attack
View on GitHub
Code for our ICLR 2023 paper Making Substitute Models More Bayesian Can Enhance Transferability of Adversarial Examples.
☆18May 31, 2023Updated 2 years ago
huseyinatahaninan / Differentially-Private-Fine-tuning-of-Language-Models
View on GitHub
☆78May 28, 2022Updated 3 years ago
DPamK / BadAgent
View on GitHub
☆29Feb 27, 2025Updated last year
tech-srl / adversarial-examples
View on GitHub
Code for the paper: "Adversarial Examples for Models of Code"
☆18Nov 16, 2020Updated 5 years ago
researchcode001 / daca
View on GitHub
Divide-and-Conquer Attack: Harnessing the Power of LLM to Bypass the Censorship of Text-to-Image Generation Mode
☆18Feb 16, 2025Updated last year
reds-lab / ASSET
View on GitHub
This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…
☆19Jun 7, 2023Updated 2 years ago
purpcode-uiuc / purpcode
View on GitHub
🔮Reasoning for Safer Code Generation; 🥇Winner Solution of Amazon Nova AI Challenge 2025
☆35Aug 24, 2025Updated 6 months ago