jeffhj / LM_PersonalInfoLeak
The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
☆23Updated 2 years ago
Alternatives and similar repositories for LM_PersonalInfoLeak:
Users that are interested in LM_PersonalInfoLeak are comparing it to the libraries listed below
- ☆13Updated 2 years ago
- In-context Example Selection with Influences☆15Updated last year
- Official Repository for Dataset Inference for LLMs☆33Updated 9 months ago
- A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020☆35Updated 2 years ago
- ☆54Updated 2 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- ☆24Updated 4 years ago
- ☆27Updated 4 years ago
- ☆9Updated 4 years ago
- ☆42Updated 3 months ago
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Updated 3 years ago
- ☆17Updated 4 years ago
- ☆18Updated 3 years ago
- ☆42Updated last year
- ☆21Updated last year
- ☆89Updated last week
- ☆35Updated last year
- Frequency-Guided Word Substitutions for Detecting Textual Adversarial Examples (EACL 2021)☆8Updated 4 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆11Updated 2 years ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆38Updated 2 years ago
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆81Updated 7 months ago
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆86Updated 11 months ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆61Updated 2 years ago
- Röttger et al. (NAACL 2024): "XSTest: A Test Suite for Identifying Exaggerated Safety Behaviours in Large Language Models"☆95Updated 2 months ago
- "Understanding Dataset Difficulty with V-Usable Information" (ICML 2022, outstanding paper)☆86Updated last year
- ☆42Updated last year
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆35Updated 10 months ago
- Explaining neural decisions contrastively to alternative decisions.☆25Updated 4 years ago
- ☆51Updated 4 years ago
- 🤫 Code and benchmark for our ICLR 2024 spotlight paper: "Can LLMs Keep a Secret? Testing Privacy Implications of Language Models via Con…☆42Updated last year