amazon-science / controlling-llm-memorizationView external linksLinks
☆39May 19, 2023Updated 2 years ago
Alternatives and similar repositories for controlling-llm-memorization
Users that are interested in controlling-llm-memorization are comparing it to the libraries listed below
Sorting:
- Official Code for ACL 2023 paper: "Ethicist: Targeted Training Data Extraction Through Loss Smoothed Soft Prompting and Calibrated Confid…☆23May 8, 2023Updated 2 years ago
- ☆13Oct 20, 2022Updated 3 years ago
- ☆300Jan 13, 2026Updated last month
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆27Oct 31, 2022Updated 3 years ago
- ☆15Feb 21, 2024Updated last year
- Training data extraction on GPT-2☆196Feb 4, 2023Updated 3 years ago
- ☆25Aug 18, 2023Updated 2 years ago
- ☆43May 23, 2023Updated 2 years ago
- ☆28Aug 31, 2025Updated 5 months ago
- ☆10Jun 5, 2021Updated 4 years ago
- Code for Dissecting Generation Modes for Abstractive Summarization Models via Ablation and Attribution (ACL2021)☆13Jun 2, 2021Updated 4 years ago
- ☆14May 8, 2024Updated last year
- Uses gpt-2 to find all completions of a sentence over a certain probability threshold.☆13Mar 17, 2020Updated 5 years ago
- Repo for "Zemi: Learning Zero-Shot Semi-Parametric Language Models from Multiple Tasks" ACL 2023 Findings☆15May 3, 2023Updated 2 years ago
- A re-implementation of the "Extracting Training Data from Large Language Models" paper by Carlini et al., 2020☆38Jul 10, 2022Updated 3 years ago
- Official implementation of Privacy Implications of Retrieval-Based Language Models (EMNLP 2023). https://arxiv.org/abs/2305.14888☆37Jun 10, 2024Updated last year
- Verbatim☆20Sep 22, 2025Updated 4 months ago
- [ACL 2023] Knowledge Unlearning for Mitigating Privacy Risks in Language Models☆86Sep 12, 2024Updated last year
- Code for the paper "Mehta, S. V., Patil, D., Chandar, S., & Strubell, E. (2023). An Empirical Investigation of the Role of Pre-training i…☆17Mar 18, 2024Updated last year
- ☆20Feb 11, 2024Updated 2 years ago
- Scripts for downloading and pre-processing the `proof-pile`, a high quality dataset of mathematical text and code.☆22Nov 26, 2022Updated 3 years ago
- ☆52May 2, 2021Updated 4 years ago
- ☆21Mar 17, 2025Updated 10 months ago
- ReCross: Unsupervised Cross-Task Generalization via Retrieval Augmentation☆24May 1, 2022Updated 3 years ago
- ☆27Dec 15, 2022Updated 3 years ago
- ANN Search through the COVID CORD-19 Dataset using SBERT.☆26May 9, 2020Updated 5 years ago
- TextHide: Tackling Data Privacy in Language Understanding Tasks☆31Apr 19, 2021Updated 4 years ago
- ☆35Jul 25, 2023Updated 2 years ago
- Firstpass generation with old SD model, Loras, embedding, etc☆32Jun 12, 2024Updated last year
- BERT models for many languages created from Wikipedia texts☆33May 25, 2020Updated 5 years ago
- M-ABSA: A Multilingual Dataset for Aspect-Based Sentiment Analysis☆13Nov 24, 2025Updated 2 months ago
- A virtual caregiver system that extracts the expression of mental and physical health states through dialogue-based human-computer intera…☆14Jan 29, 2023Updated 3 years ago
- Maintenance Information Extraction (MaintIE)☆16Jun 29, 2024Updated last year
- ☆12Updated this week
- flood fill a 2D map to create a Dijkstra map (distance map or field)☆12Sep 11, 2022Updated 3 years ago
- Code for Findings-EMNLP 2023 paper: Multi-step Jailbreaking Privacy Attacks on ChatGPT☆35Oct 15, 2023Updated 2 years ago
- ☆33Mar 13, 2025Updated 11 months ago
- EMNLP 2024: Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue☆38May 26, 2025Updated 8 months ago
- https://icml.cc/virtual/2023/poster/24354☆10Aug 15, 2023Updated 2 years ago