chichidd / llm-lora-trojanView external linksLinks
Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"
☆27Sep 11, 2024Updated last year
Alternatives and similar repositories for llm-lora-trojan
Users that are interested in llm-lora-trojan are comparing it to the libraries listed below
Sorting:
- Backdooring Neural Code Search☆14Sep 8, 2023Updated 2 years ago
- Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)☆25Oct 21, 2021Updated 4 years ago
- ☆15Dec 12, 2023Updated 2 years ago
- Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"☆17Mar 26, 2025Updated 10 months ago
- Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.☆18Dec 6, 2024Updated last year
- ☆17Sep 4, 2024Updated last year
- ☆19Feb 25, 2024Updated last year
- Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models☆28Oct 6, 2025Updated 4 months ago
- ☆22May 28, 2025Updated 8 months ago
- 🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access☆52Jun 2, 2025Updated 8 months ago
- A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)☆95Jan 20, 2025Updated last year
- Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]☆109Sep 27, 2024Updated last year
- Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)☆24Dec 9, 2021Updated 4 years ago
- This is the repository that introduces research topics related to protecting intellectual property (IP) of AI from a data-centric perspec…☆23Oct 30, 2023Updated 2 years ago
- ☆26Aug 21, 2024Updated last year
- Code for Voice Jailbreak Attacks Against GPT-4o.☆36May 31, 2024Updated last year
- [USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models☆233Jan 27, 2026Updated 2 weeks ago
- Course notes for Cyber Security (THUCST 2023 Spring)☆30Jun 11, 2023Updated 2 years ago
- CovRL-Fuzz: Fuzzing JavaScript Interpreters with Coverage-Guided Reinforcement Learning for LLM-Based Mutation☆40Nov 10, 2024Updated last year
- ☆37Oct 2, 2024Updated last year
- ☆11Feb 19, 2022Updated 3 years ago
- ☆37Sep 30, 2024Updated last year
- ☆44Feb 26, 2025Updated 11 months ago
- ☆12Dec 22, 2025Updated last month
- [USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns☆13Mar 1, 2025Updated 11 months ago
- BrainWash: A Poisoning Attack to Forget in Continual Learning☆12Apr 15, 2024Updated last year
- [NDSS 2025] Official code for our paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Wate…☆45Nov 5, 2024Updated last year
- [Usenix Security 2024] Official code implementation of "BackdoorIndicator: Leveraging OOD Data for Proactive Backdoor Detection in Federa…☆49Sep 10, 2025Updated 5 months ago
- ☆10Jun 10, 2024Updated last year
- Docker + CVE-2015-2925 = escaping from --volume☆11Jun 30, 2015Updated 10 years ago
- study_summary☆10Aug 8, 2022Updated 3 years ago
- The code implementation of MuScleLoRA (Accepted in ACL 2024)☆10Dec 1, 2024Updated last year
- Influence Maximization Paper List☆11May 11, 2022Updated 3 years ago
- The code implementation of GraCeFul (Accepted in COLING 2025)☆13Jan 27, 2025Updated last year
- Open source RAN UE centric security testing software.☆14Nov 20, 2025Updated 2 months ago
- Low-level HTTP/2 client implementation for experimenting with the protocol.☆11Jul 26, 2020Updated 5 years ago
- Official PyTorch implementation of "MM-PoisonRAG: Disrupting Multimodal RAG with Local and Global Poisoning Attacks"☆12Dec 4, 2025Updated 2 months ago
- The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…☆15Apr 3, 2025Updated 10 months ago
- ☆10Dec 26, 2023Updated 2 years ago