Source code of NAACL 2025 Findings "Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models"
☆15Dec 16, 2025Updated 2 months ago
Alternatives and similar repositories for mia-scaling
Users that are interested in mia-scaling are comparing it to the libraries listed below
Sorting:
- Official Repository for Dataset Inference for LLMs☆42Jul 25, 2024Updated last year
- ☆15Apr 4, 2024Updated last year
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆19Sep 18, 2025Updated 5 months ago
- ☆17Jul 18, 2024Updated last year
- ☆13Oct 20, 2022Updated 3 years ago
- ☆21May 23, 2025Updated 9 months ago
- Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models☆30Oct 6, 2025Updated 4 months ago
- Public implementation of the paper "On the Importance of Difficulty Calibration in Membership Inference Attacks".☆16Dec 1, 2021Updated 4 years ago
- A Synthetic Dataset for Personal Attribute Inference (NeurIPS'24 D&B)☆52Jul 27, 2025Updated 7 months ago
- This is an official repository for Practical Membership Inference Attacks Against Large-Scale Multi-Modal Models: A Pilot Study (ICCV2023…☆24Sep 29, 2023Updated 2 years ago
- Python package for measuring memorization in LLMs.☆183Jul 16, 2025Updated 7 months ago
- [NeurIPS'24] "Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration"☆200Mar 13, 2025Updated 11 months ago
- ☆10Nov 6, 2020Updated 5 years ago
- This repository contains the source code for "Membership Inference Attacks as Privacy Tools: Reliability, Disparity and Ensemble", In Pro…☆10Jan 2, 2026Updated 2 months ago
- ☆38Nov 24, 2021Updated 4 years ago
- A Model Agnostic function to directly remove specified layers from the LLM☆10May 23, 2024Updated last year
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆13Mar 1, 2025Updated last year
- A RAG that can scale 🧑🏻💻☆11May 28, 2024Updated last year
- Object recognition with Pepper using a deep learning model☆10Sep 16, 2021Updated 4 years ago
- Self-Teaching Notes on Gradient Leakage Attacks against GPT-2 models.☆14Mar 18, 2024Updated last year
- Website for release of TellMeWhy dataset for why question answering☆14Nov 11, 2022Updated 3 years ago
- Code for running forward and backward versions of GPT2☆10Nov 20, 2021Updated 4 years ago
- ☆11Jul 7, 2023Updated 2 years ago
- [AAAI 2024] Data-Free Hard-Label Robustness Stealing Attack☆14Mar 29, 2024Updated last year
- Federated Conformal Prediction with Quantile-of-Quantiles (FedCP-QQ)☆11Aug 16, 2023Updated 2 years ago
- NLPBench: Evaluating NLP-Related Problem-solving Ability in Large Language Models☆10Oct 27, 2023Updated 2 years ago
- ☆14Jun 24, 2024Updated last year
- snn implementation for spike-deeplab and spike-fcn☆12Dec 31, 2022Updated 3 years ago
- Code for the paper "Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction" …☆12Sep 6, 2023Updated 2 years ago
- Shadow Attack, LiRA, Quantile Regression and RMIA implementations in PyTorch (Online version)☆14Nov 8, 2024Updated last year
- Script for using Bing chat like a meal delivery service.☆12Mar 15, 2023Updated 2 years ago
- A tool for extracting, modifying, and crafting ASDM binary packages (CVE-2022-20829)☆13Aug 15, 2022Updated 3 years ago
- Causal Reasoning for Membership Inference Attacks☆11Oct 21, 2022Updated 3 years ago
- FairGrad, is an easy to use general purpose approach to enforce fairness for gradient descent based methods.☆14Oct 2, 2023Updated 2 years ago
- A list where most values will be None (or default)☆11Jul 19, 2023Updated 2 years ago
- Official implement of ACL'25 Findings paper "MMUnlearner: Reformulating Multimodal Machine Unlearning in the Era of Multimodal Large Lang…☆19Jun 17, 2025Updated 8 months ago
- Full List of Bad Words and Top Swear Words Banned by Google. As they closed the api☆12Sep 26, 2018Updated 7 years ago
- ☆12Apr 24, 2024Updated last year