jeffhj / LM_PersonalInfoLeak
The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
☆18Updated 2 years ago
Alternatives and similar repositories for LM_PersonalInfoLeak:
Users that are interested in LM_PersonalInfoLeak are comparing it to the libraries listed below
- ☆9Updated 4 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- ☆18Updated 3 years ago
- Official Repository for Dataset Inference for LLMs☆32Updated 7 months ago
- ☆11Updated 2 years ago
- ☆27Updated 4 years ago
- Explaining neural decisions contrastively to alternative decisions.☆24Updated 4 years ago
- ☆41Updated last month
- A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020☆35Updated 2 years ago
- Code for paper: "Spinning Language Models: Risks of Propaganda-as-a-Service and Countermeasures"☆21Updated 2 years ago
- ☆24Updated 3 years ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Updated 3 years ago
- Implementation for Poison Attacks against Text Datasets with Conditional Adversarially Regularized Autoencoder (EMNLP-Findings 2020)☆15Updated 4 years ago
- In-context Example Selection with Influences☆15Updated last year
- ☆6Updated 2 years ago
- IPython notebook with synthetic experiments for AFLite, based on the ICML 2020 paper, "Adversarial Filters of Dataset Biases".☆16Updated 4 years ago
- Code for "Imitation Attacks and Defenses for Black-box Machine Translations Systems"☆36Updated 4 years ago
- ☆87Updated last year
- Code for the paper "Be Careful about Poisoned Word Embeddings: Exploring the Vulnerability of the Embedding Layers in NLP Models" (NAACL-…☆38Updated 3 years ago
- Frequency-Guided Word Substitutions for Detecting Textual Adversarial Examples (EACL 2021)☆8Updated 3 years ago
- ☆21Updated last year
- ☆25Updated 3 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Code for paper "Search Methods for Sufficient, Socially-Aligned Feature Importance Explanations with In-Distribution Counterfactuals"☆17Updated 2 years ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆35Updated 9 months ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆32Updated 3 years ago
- ☆41Updated last year
- Code for "Evaluating Explainable AI: Which Algorithmic Explanations Help Users Predict Model Behavior?"☆46Updated last year
- ☆53Updated 9 months ago