eth-sri / llm-quantization-attack
☆10Updated 4 months ago
Related projects ⓘ
Alternatives and complementary repositories for llm-quantization-attack
- ☆35Updated 9 months ago
- Python package for measuring memorization in LLMs.☆119Updated last month
- ☆17Updated 3 weeks ago
- The official implementation of our pre-print paper "Automatic and Universal Prompt Injection Attacks against Large Language Models".☆32Updated 3 weeks ago
- ☆20Updated 9 months ago
- ☆32Updated last week
- A toolkit to assess data privacy in LLMs (under development)☆41Updated last month
- A curated list of trustworthy Generative AI papers. Daily updating...☆67Updated 2 months ago
- ☆66Updated last year
- ☆25Updated 5 months ago
- ☆22Updated last year
- Repository for Towards Codable Watermarking for Large Language Models☆29Updated last year
- An unofficial implementation of AutoDAN attack on LLMs (arXiv:2310.15140)☆29Updated 9 months ago
- ☆45Updated 5 months ago
- ☆9Updated 2 months ago
- This repository is the official implementation of the paper "ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning…☆17Updated last year
- Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]☆44Updated last month
- Code to generate NeuralExecs (prompt injection for LLMs)☆16Updated 3 months ago
- A collection of automated evaluators for assessing jailbreak attempts.☆72Updated 4 months ago
- BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks on Large Language Models☆72Updated 2 months ago
- ☆12Updated 6 months ago
- Official Repo of ICLR 24 BadChain: Backdoor Chain-of-Thought Prompting for Large Language Models☆16Updated 3 months ago
- Official Code for "Baseline Defenses for Adversarial Attacks Against Aligned Language Models"☆20Updated last year
- This is the starter kit for the Trojan Detection Challenge 2023 (LLM Edition), a NeurIPS 2023 competition.☆78Updated 5 months ago
- [CIKM 2024] Trojan Activation Attack: Attack Large Language Models using Activation Steering for Safety-Alignment.☆16Updated 3 months ago
- ☆13Updated 2 years ago
- Official Repository for ACL 2024 Paper SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding☆97Updated 3 months ago
- ☆14Updated last month
- [IEEE S&P'24] ODSCAN: Backdoor Scanning for Object Detection Models☆11Updated 5 months ago
- Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"☆14Updated 4 months ago