☆25Sep 3, 2025Updated 5 months ago
Alternatives and similar repositories for jailbreaking-frontier-models
Users that are interested in jailbreaking-frontier-models are comparing it to the libraries listed below
Sorting:
- Code for the API, workload execution, and agents underlying the LLMail-Inject Adpative Prompt Injection Challenge☆19Feb 13, 2026Updated last week
- A better way of testing, inspecting, and analyzing AI Agent traces.☆48Jan 12, 2026Updated last month
- ☆18Mar 30, 2025Updated 10 months ago
- [ACL 2025] The official code for "AGrail: A Lifelong Agent Guardrail with Effective and Adaptive Safety Detection".☆32Aug 4, 2025Updated 6 months ago
- ☆21Jul 26, 2025Updated 7 months ago
- ☆27Oct 22, 2024Updated last year
- Is In-Context Learning Sufficient for Instruction Following in LLMs? [ICLR 2025]☆32Jan 23, 2025Updated last year
- ☆35Feb 20, 2025Updated last year
- Auditing agents for fine-tuning safety☆18Oct 21, 2025Updated 4 months ago
- AI Product Analyst — Claude Code-powered data analysis toolkit☆53Updated this week
- ☆35May 21, 2025Updated 9 months ago
- Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)☆10Feb 2, 2024Updated 2 years ago
- Let Claude control a web browser on your machine.☆43Jun 5, 2025Updated 8 months ago
- Test LLMs against jailbreaks and unprecedented harms☆40Oct 19, 2024Updated last year
- ODSC 2023 workshop materials on causal graphs using implementations of DoWhy (PyWhy, EconML)☆13Nov 1, 2023Updated 2 years ago
- ☆43Feb 9, 2026Updated 2 weeks ago
- Code for the paper "Distinguishing the Knowable from the Unknowable with Language Models"☆11Apr 15, 2024Updated last year
- A Multi-Session and Multi-Therapy Benchmark for High-Realism AI Psychological Counselor☆29Jan 13, 2026Updated last month
- Official implementation of the WASP web agent security benchmark☆70Aug 12, 2025Updated 6 months ago
- ☆34Nov 12, 2024Updated last year
- ☆14May 1, 2023Updated 2 years ago
- 2020湖南省第一届人工智能大赛参赛作品☆11Feb 17, 2022Updated 4 years ago
- Precision Knowledge Editing (PKE): A novel method to reduce toxicity in LLMs while preserving performance, with robust evaluations and ha…☆11Nov 26, 2024Updated last year
- An implementation of MSSRM method☆11Mar 23, 2023Updated 2 years ago
- ☆16Jan 16, 2025Updated last year
- ☆12Aug 29, 2020Updated 5 years ago
- The Pair App is employed by the Agency of Learning for team management and communication.☆10Apr 13, 2024Updated last year
- Automated Transition States Builder☆11Jun 1, 2023Updated 2 years ago
- Program uses cv2 to display many streams from cameras, web pages, local files☆14Jan 31, 2021Updated 5 years ago
- ☆83Updated this week
- yolo目标检测算法☆15Jul 27, 2025Updated 7 months ago
- [ICLR 2025] Dissecting adversarial robustness of multimodal language model agents☆124Feb 19, 2025Updated last year
- [ACL 2025] The official implementation of the paper "PIGuard: Prompt Injection Guardrail via Mitigating Overdefense for Free".☆59Dec 4, 2025Updated 2 months ago
- ACL24☆11Jun 7, 2024Updated last year
- ☆12Sep 19, 2025Updated 5 months ago
- ☆15Sep 16, 2025Updated 5 months ago
- Code for Rethinking Prompt Optimizers: From Prompt Merits to Optimization☆12Jan 12, 2026Updated last month
- playing with gpt4☆14Mar 17, 2023Updated 2 years ago
- DETR tensor去除推理过程无用辅助头+fp16部署再次加速+解决转tensorrt 输出全为0问题的新方法。☆12Jan 9, 2024Updated 2 years ago