A new algorithm that formulates jailbreaking as a reasoning problem.
☆26Jul 2, 2025Updated 8 months ago
Alternatives and similar repositories for Adversarial-Reasoning
Users that are interested in Adversarial-Reasoning are comparing it to the libraries listed below
Sorting:
- Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"☆36Dec 18, 2024Updated last year
- ☆48Feb 25, 2026Updated last week
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated last year
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆28Oct 20, 2025Updated 4 months ago
- [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"☆67Jun 9, 2025Updated 8 months ago
- Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models☆37Jun 1, 2025Updated 9 months ago
- Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers☆66Aug 25, 2024Updated last year
- Fine-tuning base models to build robust task-specific models☆34Apr 11, 2024Updated last year
- Irolyn is a jailbreak repo extractor for iOS 18 to iOS 18.5 and iPadOS 18 to iPadOS 18.5 .☆12May 15, 2025Updated 9 months ago
- ☆40May 17, 2025Updated 9 months ago
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆54Apr 6, 2025Updated 11 months ago
- ☆33Jun 24, 2024Updated last year
- ☆37Oct 2, 2024Updated last year
- Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …☆82Updated this week
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- ☆12May 6, 2022Updated 3 years ago
- Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM☆39Jan 17, 2025Updated last year
- ☆56May 21, 2025Updated 9 months ago
- A jailbreak tweak to respring your device using the hardware buttons☆11Jun 9, 2020Updated 5 years ago
- Config files for my GitHub profile.☆38Dec 20, 2023Updated 2 years ago
- ☆11Feb 28, 2025Updated last year
- "An Introduction to Time Series Analysis with R" is a text which is currently under development and aims at giving readers a general ove…☆10Oct 1, 2021Updated 4 years ago
- Cowabunga Online is a new tool for jailbreaking iOS 18 to iOS 18.5 devices. Enjoy easy access to online jailbreak features!☆11May 15, 2025Updated 9 months ago
- The iOS 11 dock on iOS 10.☆10Jan 8, 2018Updated 8 years ago
- iOS 17.2 Jailbreak and Jailbreak guides and Download links☆10Nov 1, 2023Updated 2 years ago
- RAB: Provable Robustness Against Backdoor Attacks☆39Oct 3, 2023Updated 2 years ago
- Repository dedicated to jailbreak tweaks and other related projects.☆11Nov 18, 2020Updated 5 years ago
- Corresponding code to "FACESEC: A Fine-grained Robustness Evaluation Framework for Face Recognition Systems" @ CVPR 2021☆13Jun 22, 2021Updated 4 years ago
- Code for Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks (TIFS2024)☆13Mar 29, 2024Updated last year
- ☆11Oct 13, 2022Updated 3 years ago
- This is developer library for better preferences and useful utilities - iOS jailbreak.☆12May 10, 2021Updated 4 years ago
- an iOS tweak that neutralises jailbreaking detection as well as other anti-debugging mechanisms,☆10Dec 2, 2012Updated 13 years ago
- Create your own Wallpapers.☆13Apr 24, 2020Updated 5 years ago
- ☆14Sep 21, 2024Updated last year
- A simple JSON parser in Objective-C☆111Oct 7, 2009Updated 16 years ago
- ☆10May 4, 2024Updated last year
- To mitigate position bias in LLMs, especially in long-context scenarios, we scale only one dimension of LLMs, reducing position bias and …☆11Jun 18, 2024Updated last year
- Stop! Don't panic, you can still fix this...☆11Aug 3, 2021Updated 4 years ago
- ☆12Apr 25, 2025Updated 10 months ago