The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)
☆27Oct 31, 2022Updated 3 years ago
Alternatives and similar repositories for LM_PersonalInfoLeak
Users that are interested in LM_PersonalInfoLeak are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ☆13Oct 20, 2022Updated 3 years ago
- ☆53May 2, 2021Updated 5 years ago
- ☆22Sep 17, 2024Updated last year
- Training data extraction on GPT-2☆195Feb 4, 2023Updated 3 years ago
- ☆304Apr 8, 2026Updated last month
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆44Nov 17, 2024Updated last year
- ☆16Nov 30, 2022Updated 3 years ago
- Supplementary code for the paper "UnSplit: Data-Oblivious Model Inversion, Model Stealing, and Label Inference Attacks Against Split Lear…☆15Nov 10, 2022Updated 3 years ago
- Code for the WWW'23 paper "Sanitizing Sentence Embeddings (and Labels) for Local Differential Privacy"☆12Feb 20, 2023Updated 3 years ago
- Code for NDSS '25 paper "Passive Inference Attacks on Split Learning via Adversarial Regularization"☆13Sep 16, 2024Updated last year
- GAN you see me? enhanced data reconstruction attacks against split inference - NeurIPS 2023☆12Mar 26, 2025Updated last year
- Synthesize a new dataset based on the original dataset for later machine learning to facilitate the data sharing from the customer.☆17Feb 4, 2025Updated last year
- Code for "Differential Privacy Has Disparate Impact on Model Accuracy" NeurIPS'19☆33May 18, 2021Updated 4 years ago
- [USENIX Security 2025] SOFT: Selective Data Obfuscation for Protecting LLM Fine-tuning against Membership Inference Attacks☆20Sep 18, 2025Updated 7 months ago
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- Code and dataset for the EMNLP 2024 paper: GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory☆49Sep 26, 2024Updated last year
- Implementation of Adversarial Debiasing in PyTorch to address Gender Bias☆31Aug 5, 2020Updated 5 years ago
- ☆15Feb 21, 2024Updated 2 years ago
- [ACL'24] WebCiteS: Attributed Query-Focused Summarization on Chinese Web Search Results with Citations☆13Sep 11, 2024Updated last year
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆104Aug 13, 2024Updated last year
- ☆11Sep 20, 2019Updated 6 years ago
- Some templates for integrating Zotero, AI and Obsidian☆18Jul 29, 2024Updated last year
- analysis of public NLP corpora☆11Feb 9, 2023Updated 3 years ago
- Pytorch implementation of backdoor unlearning.☆21Jun 8, 2022Updated 3 years ago
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- [EMNLP2020] When Hearst Is not Enough: Improving Hypernymy Detection from Corpus with Distributional Models☆11Nov 10, 2020Updated 5 years ago
- Camouflage poisoning via machine unlearning☆19Jul 3, 2025Updated 10 months ago
- Privacy attacks on Split Learning☆45Nov 15, 2021Updated 4 years ago
- Code for paper "Poisoned classifiers are not only backdoored, they are fundamentally broken"☆26Jan 7, 2022Updated 4 years ago
- Official code for the ICCV2023 paper ``One-bit Flip is All You Need: When Bit-flip Attack Meets Model Training''☆20Aug 9, 2023Updated 2 years ago
- Filipino multi-modal NLP dataset. Consists of 350k+ Filipino news articles and associated images☆14Mar 11, 2025Updated last year
- ☆12Oct 23, 2022Updated 3 years ago
- ☆27Dec 15, 2022Updated 3 years ago
- ☆21Aug 19, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆10Sep 14, 2022Updated 3 years ago
- A simple script for extracting plain text from arxiv dataset: https://www.kaggle.com/Cornell-University/arxiv☆15Dec 7, 2020Updated 5 years ago
- ☆14Jun 13, 2022Updated 3 years ago
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23May 8, 2023Updated 3 years ago
- ☆57Oct 4, 2024Updated last year
- ☆11Mar 27, 2021Updated 5 years ago
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆36Oct 15, 2023Updated 2 years ago