dessertlab / Targeted-Data-Poisoning-AttacksLinks
This repository contains the code, the dataset and the experimental results related to the paper "Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks" accepted for publication at The 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024).
☆10Updated 10 months ago
Alternatives and similar repositories for Targeted-Data-Poisoning-Attacks
Users that are interested in Targeted-Data-Poisoning-Attacks are comparing it to the libraries listed below
Sorting:
- The code and data for "Are Large Pre-Trained Language Models Leaking Your Personal Information?" (Findings of EMNLP '22)☆24Updated 2 years ago
- [ICLR 2021] "Generating Adversarial Computer Programs using Optimized Obfuscations" by Shashank Srikant, Sijia Liu, Tamara Mitrovska, Shi…☆30Updated 3 years ago
- ☆24Updated last year
- Code and data of the EMNLP 2021 paper "Mind the Style of Text! Adversarial and Backdoor Attacks Based on Text Style Transfer"☆43Updated 2 years ago
- Official implementation of "Data Mixture Inference: What do BPE tokenizers reveal about their training data?"☆14Updated last month
- Code for the AAAI 2023 paper "CodeAttack: Code-based Adversarial Attacks for Pre-Trained Programming Language Models☆31Updated 2 years ago
- Implementation of the paper "Exploring the Universal Vulnerability of Prompt-based Learning Paradigm" on Findings of NAACL 2022☆29Updated 2 years ago
- ☆57Updated last year
- Code for Findings-ACL 2023 paper: Sentence Embedding Leaks More Information than You Expect: Generative Embedding Inversion Attack to Rec…☆46Updated last year
- ☆36Updated 2 years ago
- ☆26Updated last year
- ☆73Updated 3 years ago
- Documenting large text datasets 🖼️ 📚☆12Updated 6 months ago
- [EMNLP 2023] Poisoning Retrieval Corpora by Injecting Adversarial Passages https://arxiv.org/abs/2310.19156☆33Updated last year
- ☆19Updated last year
- Code for the paper "BadPrompt: Backdoor Attacks on Continuous Prompts"☆36Updated 11 months ago
- ☆6Updated 2 years ago
- The repository contains the code for analysing the leakage of personally identifiable (PII) information from the output of next word pred…☆97Updated 10 months ago
- Proof of concept code for poisoning code generation models.☆48Updated last year
- In-context Example Selection with Influences☆15Updated 2 years ago
- TextHide: Tackling Data Privacy in Language Understanding Tasks☆31Updated 4 years ago
- Releasing code for "ReCode: Robustness Evaluation of Code Generation Models"☆53Updated last year
- ☆17Updated 4 years ago
- Benchmarking MIAs against LLMs.☆19Updated 8 months ago
- Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)☆24Updated 3 years ago
- ☆45Updated last year
- ☆18Updated 3 years ago
- Repository for "SecurityEval Dataset: Mining Vulnerability Examples to Evaluate Machine Learning-Based Code Generation Techniques" publis…☆72Updated last year
- CodexLeaks: Privacy Leaks from Code Generation Language Models in GitHub Copilot☆11Updated last year
- ☆24Updated 2 years ago