brian-lou / Training-Data-Extraction-Attack-on-LLMs

This project explores training data extraction attacks on the LLaMa 7B, GPT-2XL, and GPT-2-IMDB models to discover memorized content using perplexity, perturbation scoring metrics, and large scale search queries.
12Updated last year

Alternatives and similar repositories for Training-Data-Extraction-Attack-on-LLMs:

Users that are interested in Training-Data-Extraction-Attack-on-LLMs are comparing it to the libraries listed below