hychaochao/Chat-Models-Backdoor-Attacking

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/hychaochao/Chat-Models-Backdoor-Attacking)

hychaochao / Chat-Models-Backdoor-Attacking

Code for the paper "Exploring Backdoor Vulnerabilities of Chat Models"

☆19

Alternatives and similar repositories for Chat-Models-Backdoor-Attacking

Users that are interested in Chat-Models-Backdoor-Attacking are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

NJU-RL / DIVER
View on GitHub
[ICLR 2026] The Official Implementation of DIVER
☆34Mar 5, 2026Updated 4 months ago
lancopku / SOS
View on GitHub
Code for the paper "Rethinking Stealthiness of Backdoor Attack against NLP Models" (ACL-IJCNLP 2021)
☆24Dec 9, 2021Updated 4 years ago
STARE-bench / STARE
View on GitHub
☆19Oct 12, 2025Updated 9 months ago
VisualSphinx / VisualSphinx
View on GitHub
☆17Jun 3, 2025Updated last year
wegodev2 / virtual-prompt-injection
View on GitHub
Unofficial implementation of "Backdooring Instruction-Tuned Large Language Models with Virtual Prompt Injection"
☆27Jul 6, 2024Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
ssmisya / VLMLT
View on GitHub
[CVPR' 25] Official repo for From Head to Tail: Towards Balanced Representation in Large Vision-Language Models through Adaptive Data Cal…
☆22Jun 6, 2025Updated last year
Linzwcs / AutoMusicTheoryQA
View on GitHub
☆22Nov 21, 2025Updated 8 months ago
zzzhr97 / SpecBench
View on GitHub
Code repository for the ICML 2026 paper "Reasoning over Boundaries: Enhancing Specification Alignment via Test-time Deliberation".
☆24Jun 14, 2026Updated last month
Yummytanmo / SubtleMemory
View on GitHub
A Benchmark for Fine-Grained Relational Memory Discrimination in Long-Horizon AI Agents
☆20Jun 9, 2026Updated last month
Jielin-Qiu / MMWatermark-Robustness
View on GitHub
Evaluating Durability: Benchmark Insights into Multimodal Watermarking
☆12Jun 7, 2024Updated 2 years ago
EMMA-Bench / EMMA
View on GitHub
[ICML 2025 Oral] The official repository for the paper "Can MLLMs Reason in Multimodality? EMMA: An Enhanced MultiModal ReAsoning Benchma…
☆69Jul 17, 2025Updated last year
grasses / RemovalNet
View on GitHub
Code for paper: "RemovalNet: DNN model fingerprinting removal attack", IEEE TDSC 2023.
☆10Nov 27, 2023Updated 2 years ago
thunlp / HiddenKiller
View on GitHub
Code and data of the ACL-IJCNLP 2021 paper "Hidden Killer: Invisible Textual Backdoor Attacks with Syntactic Trigger"
☆45Sep 11, 2022Updated 3 years ago
ssmisya / AdaReasoner
View on GitHub
[ICLR 2026] The official repository for the paper "AdaReasoner: Dynamic Tool Orchestration for Iterative Visual Reasoning".
☆83Feb 27, 2026Updated 5 months ago
Bare Metal GPUs on DigitalOcean Gradient AI • Ad
Purpose-built for serious AI teams training foundational models, running large-scale inference, and pushing the boundaries of what's possible.
zhaojunGUO / Awesome-LLM-Watermark
View on GitHub
Watermarking LLM papers up-to-date
☆12Dec 17, 2023Updated 2 years ago
thu-coai / CDConv
View on GitHub
Data and codes for EMNLP 2022 paper "CDConv: A Benchmark for Contradiction Detection in Chinese Conversations"
☆13May 8, 2023Updated 3 years ago
mengxiangming / QCS-SGM
View on GitHub
☆15Jan 16, 2024Updated 2 years ago
UCSB-AI / SafeKey
View on GitHub
[EMNLP 2025] Official code for the paper "SafeKey: Amplifying Aha-Moment Insights for Safety Reasoning"
☆16May 12, 2026Updated 2 months ago
danieldeutsch / qaeval
View on GitHub
☆15Aug 3, 2021Updated 4 years ago
Yongqi-Zhuo / triton-tvm
View on GitHub
Triton to TVM transpiler.
☆24Oct 14, 2024Updated last year
qabin / kb-vue-components
View on GitHub
前端组件库
☆24Mar 3, 2023Updated 3 years ago
davidgassner / AndroidDev2019Data
View on GitHub
Exercise files for the course Android Development Essential Training: Manage Data on LinkedIn Learning
☆24Aug 1, 2019Updated 6 years ago
SalesforceAIResearch / LATTE
View on GitHub
☆70Jun 2, 2026Updated last month
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
sunbelbd / invisible_backdoor_attacks
View on GitHub
☆19Mar 26, 2022Updated 4 years ago
guanjiyang / SAC
View on GitHub
☆18Oct 7, 2022Updated 3 years ago
ZOOT-Plus / zoot-plus-store
View on GitHub
ZOOT Plus 数据定期备份
☆18Jul 11, 2026Updated 2 weeks ago
hkust-nlp / mstar
View on GitHub
[ICML 2025] M-STAR (Multimodal Self-Evolving TrAining for Reasoning) Project. Diving into Self-Evolving Training for Multimodal Reasoning
☆75Jul 13, 2025Updated last year
Coxy7 / robust-finetuning
View on GitHub
[CVPR 2023] Code for the paper "Masked Images Are Counterfactual Samples for Robust Fine-tuning"
☆14Mar 24, 2023Updated 3 years ago
1362860831 / -LSB-
View on GitHub
东南大学网络空间安全学院-信息隐藏与数字水印课程实验
☆19Dec 1, 2020Updated 5 years ago
OpenSparseLLMs / Linearization
View on GitHub
☆71Jul 8, 2025Updated last year
yuanchun-li / ModelDiff
View on GitHub
☆17Jun 27, 2021Updated 5 years ago
zjiehang / RanMASK
View on GitHub
For Certified Robustness to Text Adversarial Attacks by Randomized [MASK]
☆17Oct 8, 2024Updated last year
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
Lyz1213 / Backdoored_PPLM
View on GitHub
☆15Dec 12, 2023Updated 2 years ago
OpenSparseLLMs / Skip-DiT
View on GitHub
✈️ [ICCV 2025] Towards Stabilized and Efficient Diffusion Transformers through Long-Skip-Connections with Spectral Constraints
☆80Jul 10, 2025Updated last year
kunkun0w0 / SGA
View on GitHub
The official code of "Eliminating Gradient Conflict in Reference-based Line-art Colorization" (ECCV2022)
☆39Dec 12, 2022Updated 3 years ago
hkust-nlp / Laser
View on GitHub
[ICLR2026] Laser: Learn to Reason Efficiently with Adaptive Length-based Reward Shaping
☆66May 22, 2025Updated last year
Linzwcs / echos
View on GitHub
Echos is a headless, API-driven DAW engine. It’s the backend for building AI tools that automate the entire music production lifecycle.
☆55Nov 10, 2025Updated 8 months ago
TReg-inverse / TReg
View on GitHub
Official repository for "Regularization by Texts for Latent Diffusion Inverse Solvers" (ICLR2025 spotlight)
☆18Mar 17, 2025Updated last year
bangawayoo / mb-lm-watermarking
View on GitHub
multi-bit language model watermarking (NAACL 24)
☆19Sep 20, 2024Updated last year