ethz-spylab / misleading-privacy-evalsLinks

Official code for "Evaluations of Machine Learning Privacy Defenses are Misleading" (https://arxiv.org/abs/2404.17399)

☆10

Alternatives and similar repositories for misleading-privacy-evals

Users that are interested in misleading-privacy-evals are comparing it to the libraries listed below

Sorting:

Vaidehi99 / InfoDeletionAttacks
☆44Updated 4 months ago
arumaekawa / DiLM
Implementaiton of "DiLM: Distilling Dataset into Language Model for Text-level Dataset Distillation" (accepted by NAACL2024 Findings)".
☆21Updated 4 months ago
OPTML-Group / Unlearn-Sparse
[NeurIPS23 (Spotlight)] "Model Sparsity Can Simplify Machine Unlearning" by Jinghan Jia*, Jiancheng Liu*, Parikshit Ram, Yuguang Yao, Gao…
☆71Updated last year
fjxmlzn / private-evolution-papers
The collection of papers about Private Evolution
☆16Updated last week
Jayfeather1024 / Backdoor-Enhanced-Alignment
☆20Updated 6 months ago
reds-lab / projektor
This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (…
☆14Updated last year
zeyuanyin / tiny-imagenet
☆19Updated last year
MartinPawel / In-Context-Unlearning
"In-Context Unlearning: Language Models as Few Shot Unlearners". Martin Pawelczyk, Seth Neel* and Himabindu Lakkaraju*; ICML 2024.
☆26Updated last year
wagner-group / MarkMyWords
☆29Updated last year
yaojin17 / Unlearning_LLM
[ACL 2024] Code and data for "Machine Unlearning of Pre-trained Large Language Models"
☆59Updated 8 months ago
mireshghallah / neighborhood-curvature-mia
☆21Updated last year
git-disl / Vaccine
This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
☆44Updated 7 months ago
ejones313 / auditing-llms
☆54Updated 2 years ago
IBM / SafeLoRA
Github repo for NeurIPS 2024 paper "Safe LoRA: the Silver Lining of Reducing Safety Risks when Fine-tuning Large Language Models"
☆15Updated 8 months ago
VITA-Group / DP-OPT
[ICLR'24 Spotlight] DP-OPT: Make Large Language Model Your Privacy-Preserving Prompt Engineer
☆43Updated last year
ThuCCSLab / MergeGuard
[CCS-LAMPS'24] LLM IP Protection Against Model Merging
☆15Updated 8 months ago
git-disl / Booster
This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturba…
☆28Updated 3 months ago
phycholosogy / RAG-privacy
The code for paper "The Good and The Bad: Exploring Privacy Issues in Retrieval-Augmented Generation (RAG)", exploring the privacy risk o…
☆48Updated 4 months ago
ethz-spylab / rlhf-poisoning
Code for paper "Universal Jailbreak Backdoors from Poisoned Human Feedback"
☆55Updated last year
cnut1648 / Model-Fingerprint
Fingerprint large language models
☆38Updated 11 months ago
meghdadk / SCRUB
☆44Updated 10 months ago
pratyushmaini / llm_dataset_inference
Official Repository for Dataset Inference for LLMs
☆34Updated 11 months ago
inspire-group / DP-RandP
[NeurIPS 2023] Differentially Private Image Classification by Learning Priors from Random Processes
☆12Updated 2 years ago
THU-BPM / Robust_Watermark
Code and data for paper "A Semantic Invariant Robust Watermark for Large Language Models" accepted by ICLR 2024.
☆32Updated 7 months ago
s-ball-10 / jailbreak_dynamics
☆16Updated last year
chenchenygu / watermark-learnability
☆26Updated 4 months ago
litian96 / AdaDPS
Private Adaptive Optimization with Side Information (ICML '22)
☆16Updated 3 years ago
microsoft / dp-few-shot-generation
☆26Updated last year
ethz-spylab / unlearning-vs-safety
☆23Updated 8 months ago
papersPapers / BadPrompt
Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"
☆36Updated 11 months ago