dessertlab / Targeted-Data-Poisoning-AttacksLinks
This repository contains the code, the dataset and the experimental results related to the paper "Vulnerabilities in AI Code Generators: Exploring Targeted Data Poisoning Attacks" accepted for publication at The 32nd IEEE/ACM International Conference on Program Comprehension (ICPC 2024).
☆12Updated last year
Alternatives and similar repositories for Targeted-Data-Poisoning-Attacks
Users that are interested in Targeted-Data-Poisoning-Attacks are comparing it to the libraries listed below
Sorting:
- ☆15Updated 2 years ago
- Backdooring Neural Code Search☆14Updated 2 years ago
- Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"☆43Updated 3 years ago
- This is the official code repository for paper "Exploiting the Adversarial Example Vulnerability of Transfer Learning of Source Code".☆16Updated 4 months ago
- ☆18Updated last year
- An open-source toolkit for textual backdoor attack and defense (NeurIPS 2022 D&B, Spotlight)☆200Updated 2 years ago
- ☆37Updated last year
- Training data extraction on GPT-2☆196Updated 3 years ago
- ☆15Updated last year
- Official Implementation of NeurIPS 2024 paper - BiScope: AI-generated Text Detection by Checking Memorization of Preceding Tokens☆28Updated 3 weeks ago
- ☆127Updated last year
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆233Updated 2 weeks ago
- Adversarial Attack for Pre-trained Code Models☆10Updated 3 years ago
- A collection of publications that works on code models but beyond focusing on the accuracies.☆13Updated 2 years ago
- [NDSS 2025] "CLIBE: Detecting Dynamic Backdoors in Transformer-based NLP Models"☆24Updated 5 months ago
- Robust natural language watermarking using invariant features☆28Updated 2 years ago
- Official implementation of the EMNLP 2021 paper "ONION: A Simple and Effective Defense Against Textual Backdoor Attacks"☆36Updated 4 years ago
- ☆18Updated 3 years ago
- Replication Package for "Natural Attack for Pre-trained Models of Code", ICSE 2022☆51Updated 3 months ago
- Repo for SemStamp (NAACL2024) and k-SemStamp (ACL2024)☆27Updated last year
- enchmarking Large Language Models' Resistance to Malicious Code☆14Updated last year
- [NeurIPS 2025] BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language Models☆273Updated last week
- MASTERKEY is a framework designed to explore and exploit vulnerabilities in large language model chatbots by automating jailbreak attacks…☆31Updated last year
- multi-bit language model watermarking (NAACL 24)☆17Updated last year
- ☆78Updated 3 years ago
- ☆86Updated 5 months ago
- ☆21Updated last year
- TrojanLM: Trojaning Language Models for Fun and Profit☆16Updated 4 years ago
- A lightweight library for large laguage model (LLM) jailbreaking defense.☆61Updated 5 months ago
- Composite Backdoor Attacks Against Large Language Models☆22Updated last year