HKU-TASR / Imperio

[IJCAI 2024] Imperio is an LLM-powered backdoor attack. It allows the adversary to issue language-guided instructions to control the victim model's prediction for arbitrary targets.
41Updated 7 months ago

Related projects

Alternatives and complementary repositories for Imperio