jeffhj / LM_PersonalInfoLeak
The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
☆18Updated 2 years ago
Alternatives and similar repositories for LM_PersonalInfoLeak:
Users that are interested in LM_PersonalInfoLeak are comparing it to the libraries listed below
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- ☆11Updated 2 years ago
- In-context Example Selection with Influences☆15Updated last year
- ☆9Updated 4 years ago
- ☆18Updated 3 years ago
- Official Repository for Dataset Inference for LLMs☆31Updated 6 months ago
- ☆27Updated 4 years ago
- A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020☆35Updated 2 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- ☆41Updated last week
- ☆31Updated last year
- ☆24Updated 3 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Updated 3 years ago
- ☆6Updated 2 years ago
- ☆21Updated last year
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Updated 4 years ago
- ☆87Updated last year
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated last year
- Explaining neural decisions contrastively to alternative decisions.☆24Updated 3 years ago
- Causal Reasoning for Membership Inference Attacks☆10Updated 2 years ago
- A framework for adversarial attacks against token classification models☆32Updated 3 years ago
- Natural Universal Trigger Search (NUTS)☆21Updated 3 years ago
- A Diagnostic Study of Explainability Techniques for Text Classification☆66Updated 4 years ago
- ☆17Updated 3 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆22Updated last year
- ☆25Updated 3 years ago
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆84Updated last year
- ☆31Updated last year
- Code for paper: "Spinning Language Models: Risks of Propaganda-as-a-Service and Countermeasures"☆21Updated 2 years ago