Official implementation of “Response Attack: Exploiting Contextual Priming to Jailbreak Large Language Models” (AAAI 2026).
☆37Mar 22, 2026Updated last month
Alternatives and similar repositories for Response-Attack
Users that are interested in Response-Attack are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Official implementation of Visco-Attack (EMNLP 2025 Main). An open-source one-click reproduction script is also provided.☆30Apr 11, 2026Updated 3 weeks ago
- Universal preflight security scanner for AI coding agents — Detects hooks injection, credential exfiltration & backdoors in .cursorrules,…☆69Apr 9, 2026Updated 3 weeks ago
- Official Implementation of "Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts" at EMNLP 202…☆13Oct 27, 2024Updated last year
- Instant Graph Neural Networks for Dynamic Graphs☆11Dec 28, 2022Updated 3 years ago
- Codes and data for KDD 2024 Research Track paper "ProCom: A Few-shot Targeted Community Detection Algorithm"☆11Aug 15, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Panda Guard is designed for researching jailbreak attacks, defenses, and evaluation algorithms for large language models (LLMs).☆67Mar 23, 2026Updated last month
- Graph Coarsening with Neural Networks☆11Mar 3, 2022Updated 4 years ago
- Multi-step reasoning MLLM☆22Mar 8, 2026Updated last month
- Diagnostic Framework for LLMs and MLLMs☆36Mar 2, 2026Updated 2 months ago
- The code of Dynamic Graph Learning Based on Hierarchical Memory for Origin-Destination Demand Prediction☆14Apr 29, 2022Updated 4 years ago
- [CVPR2025] T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation☆33Jul 10, 2025Updated 9 months ago
- ☆24May 23, 2025Updated 11 months ago
- 🧨 TradeTrap: Are LLM-based Trading Agents Truly Reliable and Faithful?☆74Nov 27, 2025Updated 5 months ago
- From Accuracy to Robustness: A Study of Rule- and Model-based Verifiers in Mathematical Reasoning.☆25Oct 7, 2025Updated 6 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- [ACL 2024] CodeAttack: Revealing Safety Generalization Challenges of Large Language Models via Code Completion☆59Oct 1, 2025Updated 7 months ago
- XJTU OS Lab☆15Dec 3, 2022Updated 3 years ago
- ☆18Apr 7, 2025Updated last year
- ☆16Mar 22, 2025Updated last year
- [ICLR 2025] Code&Data for the paper "Super(ficial)-alignment: Strong Models May Deceive Weak Models in Weak-to-Strong Generalization"☆15Jun 21, 2024Updated last year
- ☆11Jan 19, 2025Updated last year
- A toolkit for testing and improving named entity recognition [ESEC/FSE'23]☆11Aug 31, 2023Updated 2 years ago
- ☆11Sep 10, 2024Updated last year
- ☆13Sep 12, 2024Updated last year
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Motif-aware Riemannian Graph Neural Network with Generative-Contrastive Learning☆19Apr 15, 2024Updated 2 years ago
- Jailbreak Evo☆22Jun 2, 2025Updated 11 months ago
- ☆30May 22, 2024Updated last year
- The implementation for our paper, "Improving Simultaneous Machine Translation with Monolingual Data," accepted to AAAI 2023. 🎉☆12Jul 19, 2023Updated 2 years ago
- [ICDE'2024] "GraphAug: Graph Augmentation for Recommendation"☆24Sep 17, 2024Updated last year
- Welcome to the official repository for Siren, a project aimed at understanding and mitigating harmful behaviors in large language models …☆15Sep 12, 2025Updated 7 months ago
- ☆50Nov 9, 2025Updated 5 months ago
- ☆18Oct 20, 2024Updated last year
- Official codes for GRA (Accepted by ICCV2023)☆17Jul 18, 2023Updated 2 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Implementation codes for KDD24 paper "LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?"☆34Sep 10, 2024Updated last year
- [ICLR 2026] "When AI Agents Collude Online: Financial Fraud Risks by Collaborative LLM Agents on Social Platforms"☆39Feb 3, 2026Updated 3 months ago
- "GraphArena: Evaluating and Exploring Large Language Models on Graph Computation" in ICLR 2025☆34Mar 2, 2025Updated last year
- ☆19Oct 2, 2023Updated 2 years ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated last year
- ☆17Feb 22, 2024Updated 2 years ago
- [ICSE'25] Aligning the Objective of LLM-based Program Repair☆23Mar 8, 2025Updated last year