The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
☆27Oct 31, 2022Updated 3 years ago
Alternatives and similar repositories for LM_PersonalInfoLeak
Users that are interested in LM_PersonalInfoLeak are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 20, 2022Updated 3 years ago
- ☆53May 2, 2021Updated 5 years ago
- ☆22Sep 17, 2024Updated last year
- ☆40May 19, 2023Updated 3 years ago
- A collection of implementations of fair ML algorithms☆12Jan 7, 2018Updated 8 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Training data extraction on GPT-2☆195Feb 4, 2023Updated 3 years ago
- Source code for the paper "Exploiting Excessive Invariance caused by Norm-Bounded Adversarial Robustness"☆25Feb 12, 2020Updated 6 years ago
- ☆44Nov 17, 2024Updated last year
- ☆16Nov 30, 2022Updated 3 years ago
- A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020☆39Jul 10, 2022Updated 3 years ago
- [Preprint] Backdoor Attacks on Federated Learning with Lottery Ticket Hypothesis☆10Sep 23, 2021Updated 4 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- ☆42May 23, 2023Updated 3 years ago
- [EMNLP 2022] Distillation-Resistant Watermarking (DRW) for Model Protection in NLP☆13Aug 17, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [Preprint] On the Effectiveness of Mitigating Data Poisoning Attacks with Gradient Shaping☆10Feb 27, 2020Updated 6 years ago
- Code for "Differential Privacy Has Disparate Impact on Model Accuracy" NeurIPS'19☆33May 18, 2021Updated 5 years ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆21Sep 18, 2025Updated 9 months ago
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆51Sep 26, 2024Updated last year
- ☆46Nov 10, 2019Updated 6 years ago
- Implementation of Adversarial Debiasing in PyTorch to address Gender Bias☆31Aug 5, 2020Updated 5 years ago
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆13Sep 11, 2024Updated last year
- ☆11Sep 20, 2019Updated 6 years ago
- Some templates for integrating Zotero, AI and Obsidian☆18Jul 29, 2024Updated last year
- Bare Metal GPUs on DigitalOcean Gradient AI • AdPurpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
- Pytorch implementation of backdoor unlearning.☆21Jun 8, 2022Updated 4 years ago
- Setu is a comprehensive pipeline designed to clean, filter, and deduplicate diverse data sources including Web, PDF, and Speech data. Bui…☆16May 17, 2024Updated 2 years ago
- [EMNLP2020] When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models☆11Nov 10, 2020Updated 5 years ago
- Camouflage poisoning via machine unlearning☆19Jul 3, 2025Updated 11 months ago
- ☆17Aug 13, 2020Updated 5 years ago
- Privacy attacks on Split Learning☆45Nov 15, 2021Updated 4 years ago
- Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"☆26Jan 7, 2022Updated 4 years ago
- The code of AAAI-21 paper titled "Defending against Backdoors in Federated Learning with Robust Learning Rate".☆38Oct 3, 2022Updated 3 years ago
- Repo for the paper "Bounding Training Data Reconstruction in Private (Deep) Learning".☆12Jun 16, 2023Updated 3 years ago
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- MCP Server für Deutsche Gesetzestexte☆46Dec 19, 2025Updated 5 months ago
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆14Mar 11, 2025Updated last year
- ☆21Aug 19, 2024Updated last year
- [Talk] How to look like a statistician: a developer's guide to probabilistic programming☆10Sep 18, 2018Updated 7 years ago
- ☆10Sep 14, 2022Updated 3 years ago
- A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv☆15Dec 7, 2020Updated 5 years ago
- ☆14Jun 13, 2022Updated 4 years ago