Official implementation of “Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models” (AAAI 2026).
☆37Mar 22, 2026Updated 2 months ago
Alternatives and similar repositories for Response-Attack
Users that are interested in Response-Attack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of Visco-Attack (EMNLP 2025 Main). An open-source one-click reproduction script is also provided.☆30Apr 11, 2026Updated 2 months ago
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆72May 29, 2026Updated 2 weeks ago
- Instant Graph Neural Networks for Dynamic Graphs☆11Dec 28, 2022Updated 3 years ago
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- ☆16Sep 1, 2025Updated 9 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- DataBaseLab,XJTU 西交数据库实验☆13Jun 25, 2024Updated last year
- Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).☆68Mar 23, 2026Updated 2 months ago
- Graph Coarsening with Neural Networks☆11Mar 3, 2022Updated 4 years ago
- Multi-step reasoning MLLM☆24Mar 8, 2026Updated 3 months ago
- Diagnostic Framework for LLMs and MLLMs☆38Mar 2, 2026Updated 3 months ago
- [CVPR2025] T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation☆33Jul 10, 2025Updated 11 months ago
- ☆24May 23, 2025Updated last year
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- 🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?☆79Nov 27, 2025Updated 6 months ago
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- [WSDM 2026] LookAhead Tuning: Safer Language Models via Partial Answer Previews☆17Dec 14, 2025Updated 6 months ago
- XJTU OS Lab☆15Dec 3, 2022Updated 3 years ago
- ☆19Apr 7, 2025Updated last year
- ☆16Mar 22, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- ☆11Sep 10, 2024Updated last year
- Jailbreak Evo☆22Jun 2, 2025Updated last year
- Motif-aware Riemannian Graph Neural Network with Generative-Contrastive Learning☆19Apr 15, 2024Updated 2 years ago
- Code for "Knowledge Card: Filling LLMs' Knowledge Gaps with Plug-in Specialized Language Models", ICLR 2024 Oral.☆21Feb 4, 2026Updated 4 months ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- ☆30May 22, 2024Updated 2 years ago
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Sep 12, 2025Updated 9 months ago
- ☆50Nov 9, 2025Updated 7 months ago
- implementation of paper "Large Language Models are In-Context Semantic Reasoners rather than Symbolic Reasoners"☆20Aug 17, 2023Updated 2 years ago
- Implementation codes for KDD24 paper "LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?"☆34Sep 10, 2024Updated last year
- ☆19Oct 2, 2023Updated 2 years ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated last year
- ☆17Feb 22, 2024Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- [ICSE'25] Aligning the Objective of LLM-based Program Repair☆24Mar 8, 2025Updated last year
- ☆19Mar 25, 2024Updated 2 years ago
- PiD: Fast and High-Resolution Latent Decoding with Pixel Diffusion☆716Jun 3, 2026Updated last week
- A lifecycle guard skill.☆180Mar 27, 2026Updated 2 months ago
- ☆27Mar 17, 2025Updated last year
- ☆18Dec 12, 2025Updated 6 months ago
- ☆15Aug 1, 2023Updated 2 years ago