Helloworld10011 / Adversarial-ReasoningView external linksLinks
A new algorithm that formulates jailbreaking as a reasoning problem.
☆26Jul 2, 2025Updated 7 months ago
Alternatives and similar repositories for Adversarial-Reasoning
Users that are interested in Adversarial-Reasoning are comparing it to the libraries listed below
Sorting:
- Code and data to go with the Zhu et al. paper "An Objective for Nuanced LLM Jailbreaks"☆36Dec 18, 2024Updated last year
- ☆47Feb 4, 2026Updated last week
- All in How You Ask for It: Simple Black-Box Method for Jailbreak Attacks☆18Apr 24, 2024Updated last year
- [ICCVW 2025 (Oral)] Robust-LLaVA: On the Effectiveness of Large-Scale Robust Image Encoders for Multi-modal Large Language Models☆28Oct 20, 2025Updated 3 months ago
- [ICLR 2025] Official Repository for "Tamper-Resistant Safeguards for Open-Weight LLMs"☆66Jun 9, 2025Updated 8 months ago
- Code for ICLR 2025 Failures to Find Transferable Image Jailbreaks Between Vision-Language Models☆37Jun 1, 2025Updated 8 months ago
- Official implementation of paper: DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers☆65Aug 25, 2024Updated last year
- Fine-tuning base models to build robust task-specific models☆34Apr 11, 2024Updated last year
- ☆39May 17, 2025Updated 8 months ago
- Irolyn is a jailbreak repo extractor for iOS 18 to iOS 18.5 and iPadOS 18 to iPadOS 18.5 .☆12May 15, 2025Updated 9 months ago
- ☆33Jun 24, 2024Updated last year
- [COLM 2025] SEAL: Steerable Reasoning Calibration of Large Language Models for Free☆52Apr 6, 2025Updated 10 months ago
- ☆37Oct 2, 2024Updated last year
- Awesome Large Reasoning Model(LRM) Safety.This repository is used to collect security-related research on large reasoning models such as …☆81Feb 6, 2026Updated last week
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 4 months ago
- Chain of Attack: a Semantic-Driven Contextual Multi-Turn attacker for LLM☆39Jan 17, 2025Updated last year
- ☆12May 6, 2022Updated 3 years ago
- ☆55May 21, 2025Updated 8 months ago
- "An Introduction to Time Series Analysis with R" is a text which is currently under development and aims at giving readers a general ove…☆10Oct 1, 2021Updated 4 years ago
- Cowabunga Online is a new tool for jailbreaking iOS 18 to iOS 18.5 devices. Enjoy easy access to online jailbreak features!☆11May 15, 2025Updated 9 months ago
- iOS 17.2 Jailbreak and Jailbreak guides and Download links☆10Nov 1, 2023Updated 2 years ago
- Config files for my GitHub profile.☆38Dec 20, 2023Updated 2 years ago
- A jailbreak tweak to respring your device using the hardware buttons☆11Jun 9, 2020Updated 5 years ago
- RAB: Provable Robustness Against Backdoor Attacks☆39Oct 3, 2023Updated 2 years ago
- This is developer library for better preferences and useful utilities - iOS jailbreak.☆12May 10, 2021Updated 4 years ago
- Official implementation of Tabular Transfer Learning via Prompting LLMs (COLM 2024).☆12Aug 6, 2024Updated last year
- Deployed contracts, helper contract, js and ts bindings☆13Mar 22, 2021Updated 4 years ago
- Create your own Wallpapers.☆13Apr 24, 2020Updated 5 years ago
- Ios 11-11.1.2 Jailbreak And ios 10-10.3.3 jailbreak. ORIGINAL PROJECT: https://github.com/JosephShenton/C0F3☆13Feb 8, 2018Updated 8 years ago
- ☆20Feb 3, 2025Updated last year
- [DEPRECATED] The missing Blacklist app for your iOS 5/6 with private APIs. No Jailbreak Required!☆148Sep 23, 2013Updated 12 years ago
- Unveiling and Mitigating Bias in Mental Health Analysis with Large Language Models☆12Jun 21, 2024Updated last year
- Corresponding code to "Improving Robustness of ML Classifiers against Realizable Evasion Attacks Using Conserved Features" @ USENIX Secur…☆11Aug 5, 2019Updated 6 years ago
- ☆12Apr 25, 2025Updated 9 months ago
- ☆10Mar 22, 2019Updated 6 years ago
- Test equality between a black-box LLM API and a reference distribution☆12Oct 29, 2024Updated last year
- Reproduction of "Latent Weights Do Not Exist: Rethinking Binarized Neural Network Optimization" for the Reproducibility challenge@NeurIPS…☆11Jan 14, 2020Updated 6 years ago
- A simple JSON parser in Objective-C☆111Oct 7, 2009Updated 16 years ago
- Code for Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks (TIFS2024)☆13Mar 29, 2024Updated last year