Official implementation of “Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models” (AAAI 2026).
☆33Dec 17, 2025Updated 3 months ago
Alternatives and similar repositories for Response-Attack
Users that are interested in Response-Attack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of Visco-Attack (EMNLP 2025 Main). We will progressively release the code and one-click reproduction scripts.☆30Aug 22, 2025Updated 7 months ago
- Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechan…☆16Aug 6, 2024Updated last year
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- Instant Graph Neural Networks for Dynamic Graphs☆11Dec 28, 2022Updated 3 years ago
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- Multi-step reasoning MLLM☆16Mar 8, 2026Updated 2 weeks ago
- ☆16Sep 1, 2025Updated 6 months ago
- DataBaseLab,XJTU 西交数据库实验☆11Jun 25, 2024Updated last year
- ☆38Jun 14, 2025Updated 9 months ago
- Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).☆66Jan 19, 2026Updated 2 months ago
- Graph Coarsening with Neural Networks☆11Mar 3, 2022Updated 4 years ago
- Diagnostic Framework for LLMs and MLLMs☆35Mar 2, 2026Updated 3 weeks ago
- A lifecycle guard skill.☆73Updated this week
- zotero-pdf2zh 的 Homebrew 安装脚本,让你可以轻松在本地部署 Zotero PDF 翻译服务器☆44Mar 5, 2026Updated 2 weeks ago
- ☆24May 23, 2025Updated 10 months ago
- 🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?☆74Nov 27, 2025Updated 3 months ago
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆59Oct 1, 2025Updated 5 months ago
- XJTU OS Lab☆15Dec 3, 2022Updated 3 years ago
- ☆18Apr 7, 2025Updated 11 months ago
- ☆16Mar 22, 2025Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆14Jun 21, 2024Updated last year
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- ☆11Sep 10, 2024Updated last year
- Tasks for describing differences between text distributions.☆17Aug 9, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- Motif-aware Riemannian Graph Neural Network with Generative-Contrastive Learning☆19Apr 15, 2024Updated last year
- ☆20Nov 15, 2024Updated last year
- [ICLR 2026] "When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms"☆33Feb 3, 2026Updated last month
- ☆30May 22, 2024Updated last year
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Feb 4, 2026Updated last month
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Sep 12, 2025Updated 6 months ago
- ☆49Nov 9, 2025Updated 4 months ago
- ☆18Oct 20, 2024Updated last year
- "GraphArena: Evaluating and Exploring Large Language Models on Graph Computation" in ICLR 2025☆32Mar 2, 2025Updated last year
- Implementation codes for KDD24 paper "LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?"☆33Sep 10, 2024Updated last year
- implementation of paper "Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners"☆20Aug 17, 2023Updated 2 years ago