KaiyuanZh / SOFT
View external linksLinks

[USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks

☆19

Alternatives and similar repositories for SOFT

Users that are interested in SOFT are comparing it to the libraries listed below

Sorting:

SolidShen / RIPPLE_official
View on GitHub
☆20Feb 11, 2024Updated 2 years ago
Gwinhen / MOTH
View on GitHub
This is the implementation for IEEE S&P 2022 paper "Model Orthogonalization: Class Distance Hardening in Neural Networks for Better Secur…
☆11Aug 24, 2022Updated 3 years ago
XZ-X / PEM
View on GitHub
☆15Dec 29, 2023Updated 2 years ago
Gwinhen / DRUPE
View on GitHub
Distribution Preserving Backdoor Attack in Self-supervised Learning
☆20Jan 27, 2024Updated 2 years ago
PurduePAML / DBS
View on GitHub
☆18Aug 15, 2022Updated 3 years ago
Megum1 / BEAGLE
View on GitHub
[NDSS'23] BEAGLE: Forensics of Deep Learning Backdoor Attack for Better Defense
☆17May 7, 2024Updated last year
Gwinhen / BackdoorVault
View on GitHub
A toolbox for backdoor attacks.
☆23Jan 13, 2023Updated 3 years ago
AISIGSJTU / Siren
View on GitHub
Siren: Byzantine-robust Federated Learning via Proactive Alarming (SoCC '21)
☆11Mar 28, 2024Updated last year
PurduePAML / Exray
View on GitHub
☆12May 27, 2022Updated 3 years ago
zhaisf / CLiD
View on GitHub
[NeurIPS 2024] "Membership Inference on Text-to-image Diffusion Models via Conditional Likelihood Discrepancy"
☆12Sep 15, 2025Updated 5 months ago
lvpeizhuo / MEA-Defender
View on GitHub
This is the source code for MEA-Defender. Our paper is accepted by the IEEE Symposium on Security and Privacy (S&P) 2024.
☆29Nov 19, 2023Updated 2 years ago
KaiyuanZh / FLIP
View on GitHub
[ICLR 2023, Best Paper Award at ECCV’22 AROW Workshop] FLIP: A Provable Defense Framework for Backdoor Mitigation in Federated Learning
☆60Dec 11, 2024Updated last year
parameterlab / mia-scaling
View on GitHub
Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"
☆15Dec 16, 2025Updated 2 months ago
Megum1 / LOTUS
View on GitHub
[CVPR'24] LOTUS: Evasive and Resilient Backdoor Attacks through Sub-Partitioning
☆15Jan 15, 2025Updated last year
yxoh / prompt_leak_usenix2024
View on GitHub
☆14May 8, 2024Updated last year
RJ-T / NIPS2022_EP_BNP
View on GitHub
Official Implementation of NIPS 2022 paper Pre-activation Distributions Expose Backdoor Neurons
☆15Jan 13, 2023Updated 3 years ago
Lyz1213 / BadEdit
View on GitHub
☆37Oct 17, 2024Updated last year
ziansu / codeart
View on GitHub
Official repo for FSE'24 paper "CodeArt: Better Code Models by Attention Regularization When Symbols Are Lacking"
☆18Mar 10, 2025Updated 11 months ago
yfchen1994 / poisoning_membership
View on GitHub
☆20Oct 28, 2025Updated 3 months ago
Gwinhen / HardBeat
View on GitHub
This is the repository for USENIX Security 2023 paper "Hard-label Black-box Universal Adversarial Patch Attack".
☆15Sep 5, 2023Updated 2 years ago
GiantSeaweed / DECREE
View on GitHub
Official repository for CVPR'23 paper: Detecting Backdoors in Pre-trained Encoders
☆36Sep 25, 2023Updated 2 years ago
eth-sri / llm-anonymization
View on GitHub
☆21May 23, 2025Updated 8 months ago
PurduePAML / K-ARM_Backdoor_Optimization
View on GitHub
☆18Jun 15, 2021Updated 4 years ago
Megum1 / ODSCAN
View on GitHub
[IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Models
☆20Oct 5, 2025Updated 4 months ago
ZhangZhuoSJTU / LINT
View on GitHub
☆17Sep 4, 2024Updated last year
facebookresearch / calibration_membership
View on GitHub
Public implementation of the paper "On the Importance of Difficulty Calibration in Membership Inference Attacks".
☆16Dec 1, 2021Updated 4 years ago
SolidShen / BAIT
View on GitHub
🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access
☆52Jun 2, 2025Updated 8 months ago
jianshuod / TBA
View on GitHub
Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''
☆20Aug 9, 2023Updated 2 years ago
tianshuocong / TePA
View on GitHub
[S&P'24] Test-Time Poisoning Attacks Against Test-Time Adaptation Models
☆19Feb 18, 2025Updated 11 months ago
ganeshdg95 / Leveraging-Adversarial-Examples-to-Quantify-Membership-Information-Leakage
View on GitHub
☆19Mar 6, 2023Updated 2 years ago
xaddwell / TFLlib
View on GitHub
TFLlib-Trustworthy Federated Learning Library and Benchmark
☆62Nov 15, 2025Updated 3 months ago
zwang84 / zsdb3kd
View on GitHub
Knowledge distillation (KD) from a decision-based black-box (DB3) teacher without training data.
☆22May 3, 2022Updated 3 years ago
David-Li0406 / AI-Supervision-Risk
View on GitHub
☆21Mar 17, 2025Updated 10 months ago
purdue-hcss / SecureChain
View on GitHub
☆46Sep 3, 2025Updated 5 months ago
PurduePAML / PICCOLO
View on GitHub
☆26Dec 1, 2022Updated 3 years ago
zealscott / SynMeter
View on GitHub
A principled library for tuning, training and evaluating tabular data synthesis on fidelity, privacy and utility. CCS 2025.
☆26Aug 17, 2025Updated 5 months ago
Gwinhen / PixelBackdoor
View on GitHub
This is the implementation for CVPR 2022 Oral paper "Better Trigger Inversion Optimization in Backdoor Scanning."
☆24Apr 5, 2022Updated 3 years ago
njuaplusplus / mirror
View on GitHub
Code for NDSS 2022 paper "MIRROR: Model Inversion for Deep Learning Network with High Fidelity"
☆27May 9, 2023Updated 2 years ago
jeffhj / LM_PersonalInfoLeak
View on GitHub
The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
☆28Oct 31, 2022Updated 3 years ago

KaiyuanZh / SOFTView external linksLinks

Alternatives and similar repositories for SOFT

KaiyuanZh / SOFT
View external linksLinks