chichidd/llm-lora-trojan

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/chichidd/llm-lora-trojan)

chichidd / llm-lora-trojan

Code for paper "The Philosopher’s Stone: Trojaning Plugins of Large Language Models"

☆27

Alternatives and similar repositories for llm-lora-trojan

Users that are interested in llm-lora-trojan are comparing it to the libraries listed below

Sorting:

lancopku / RAP
View on GitHub
Code for the paper "RAP: Robustness-Aware Perturbations for Defending against Backdoor Attacks on NLP Models" (EMNLP 2021)
☆25Oct 21, 2021Updated 4 years ago
Lyz1213 / Backdoored_PPLM
View on GitHub
☆15Dec 12, 2023Updated 2 years ago
casperllm / CASPER
View on GitHub
☆15Apr 27, 2024Updated last year
PurCL / ProSec
View on GitHub
Official repo for "ProSec: Fortifying Code LLMs with Proactive Security Alignment"
☆17Feb 26, 2026Updated last week
Zhang-Yihao / Adversarial-Representation-Engineering
View on GitHub
Official implementation repository for the paper Towards General Conceptual Model Editing via Adversarial Representation Engineering.
☆19Dec 6, 2024Updated last year
ZhangZhuoSJTU / LINT
View on GitHub
☆17Sep 4, 2024Updated last year
mbzuai-nlp / AudioJailbreak
View on GitHub
Audio Jailbreak: An Open Comprehensive Benchmark for Jailbreaking Large Audio-Language Models
☆30Oct 6, 2025Updated 5 months ago
fra31 / rlhf-trojan-competition-submission
View on GitHub
☆19Feb 25, 2024Updated 2 years ago
AI-secure / AdvAgent
View on GitHub
☆22May 28, 2025Updated 9 months ago
SolidShen / BAIT
View on GitHub
🔥🔥🔥 Detecting hidden backdoors in Large Language Models with only black-box access
☆52Jun 2, 2025Updated 9 months ago
ShenaoW / awesome-llm-supply-chain-security
View on GitHub
A curated list of awesome resources about LLM supply chain security (including papers, security reports and CVEs)
☆96Jan 20, 2025Updated last year
lancopku / agent-backdoor-attacks
View on GitHub
Code&Data for the paper "Watch Out for Your Agents! Investigating Backdoor Threats to LLM-Based Agents" [NeurIPS 2024]
☆109Sep 27, 2024Updated last year
lancopku / SOS
View on GitHub
Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)
☆24Dec 9, 2021Updated 4 years ago
conditionWang / Data_Centric_AI_IP_Protection
View on GitHub
This is the repository that introduces research topics related to protecting intellectual property (IP) of AI from a data-centric perspec…
☆23Oct 30, 2023Updated 2 years ago
zhangrui4041 / Instruction_Backdoor_Attack
View on GitHub
☆26Aug 21, 2024Updated last year
SchwinnL / LLM_Embedding_Attack
View on GitHub
Code to conduct an embedding attack on LLMs
☆31Jan 10, 2025Updated last year
sleeepeer / PoisonedRAG
View on GitHub
[USENIX Security 2025] PoisonedRAG: Knowledge Corruption Attacks to Retrieval-Augmented Generation of Large Language Models
☆238Jan 27, 2026Updated last month
zhaochenyang20 / Cyber-Security
View on GitHub
Course notes for Cyber Security (THUCST 2023 Spring)
☆31Jun 11, 2023Updated 2 years ago
seclab-yonsei / CovRL-Fuzz
View on GitHub
CovRL-Fuzz: Fuzzing JavaScript Interpreters with Coverage-Guided Reinforcement Learning for LLM-Based Mutation
☆41Nov 10, 2024Updated last year
OSU-NLP-Group / EIA_against_webagent
View on GitHub
☆37Oct 2, 2024Updated last year
WUSTL-CSPL / LLMJailbreak
View on GitHub
☆37Sep 30, 2024Updated last year
zzh816 / WKAFL-code
View on GitHub
☆11Feb 19, 2022Updated 4 years ago
PascalMat / 5G-AKA-simulation-in-C
View on GitHub
This is the implementation of the 5G-AKA for the master thesis: Identity management, identification mechanisms and privacy protection met…
☆11Jul 22, 2019Updated 6 years ago
LLMSmith / LLMSmith
View on GitHub
☆44Feb 26, 2025Updated last year
MrChenFeng / CLIPCleaner_ACMMM2024
View on GitHub
CLIPCleaner: Cleaning Noisy Labels with CLIP (ACM MM2024)
☆13Apr 28, 2025Updated 10 months ago
mint-vu / Brainwash
View on GitHub
BrainWash: A Poisoning Attack to Forget in Continual Learning
☆12Apr 15, 2024Updated last year
shaoshuo-ss / EaaW
View on GitHub
[NDSS 2025] Official code for our paper "Explanation as a Watermark: Towards Harmless and Multi-bit Model Ownership Verification via Wate…
☆45Nov 5, 2024Updated last year
TrustAIRLab / HateBench
View on GitHub
[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
☆13Mar 1, 2025Updated last year
Kagami / docker_cve-2015-2925
View on GitHub
Docker + CVE-2015-2925 = escaping from --volume
☆11Jun 30, 2015Updated 10 years ago
dut-laowang / PCLMM
View on GitHub
The code implementation for the article "Towards Patronizing and Condescending Language in Chinese Videos: A Multimodal Dataset and Fram…
☆16Apr 3, 2025Updated 11 months ago
BUPT-GAMMA / CITGNN
View on GitHub
☆10Dec 26, 2023Updated 2 years ago
Shi-D / IMPapers
View on GitHub
Influence Maximization Paper List
☆11May 11, 2022Updated 3 years ago
adilahmad17 / Veil
View on GitHub
☆11Jun 10, 2024Updated last year
HITsz-TMG / Cognitive-Visual-Language-Mapper
View on GitHub
The codes and datasets about our ACL 2024 Main Conference paper titled "Cognitive Visual-Language Mapper: Advancing Multimodal Comprehens…
☆17Jan 24, 2025Updated last year
lynnegaogao / TransforLearn
View on GitHub
Interactive Visual Tutorial for the Transformer Model
☆13Sep 26, 2023Updated 2 years ago
fzwark / Secure_LLM_System
View on GitHub
☆14Mar 9, 2025Updated last year
Maryeon / whiten_mtd
View on GitHub
Official repository of paper "Let All be Whitened: Multi-teacher Distillation for Efficient Visual Retrieval"
☆10Dec 20, 2023Updated 2 years ago
EsYoon7 / RLHF-TLCR
View on GitHub
[ACL'24 Findings] Official code for "TLCR: Token-Level Continuous Reward for Fine-grained Reinforcement Learning from Human Feedback"
☆12Dec 6, 2024Updated last year
wgc-research / fgssl
View on GitHub
☆10Jul 18, 2023Updated 2 years ago