theshi-1128/ReDPJ

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/theshi-1128/ReDPJ)

theshi-1128 / ReDPJ

A novel jailbreak attack unveiling an overlooked attack surface inherently in the chain-of-thought reasoning trajectory of LLMs

☆22

Alternatives and similar repositories for ReDPJ

Users that are interested in ReDPJ are comparing it to the libraries listed below

Sorting:

theshi-1128 / llm-defense
View on GitHub
An easy-to-use Python framework to defend against jailbreak prompts.
☆21Mar 22, 2025Updated 11 months ago
Xuzhenhua55 / awesome-llm-copyright-protection
View on GitHub
A curated collection of research and techniques for protecting intellectual property of large language models, including watermarking, fi…
☆46Feb 15, 2026Updated 3 weeks ago
hac425xxx / trapfuzzer-gdb
View on GitHub
A gdb for fuzzing
☆22Nov 26, 2021Updated 4 years ago
LLMSecurity / MasterKey
View on GitHub
MASTERKEY is a framework designed to explore and exploit vulnerabilities in large language model chatbots by automating jailbreak attacks…
☆33Sep 12, 2024Updated last year
NJUNLP / ReNeLLM
View on GitHub
The official implementation of our NAACL 2024 paper "A Wolf in Sheep’s Clothing: Generalized Nested Jailbreak Prompts can Fool Large Lang…
☆153Sep 2, 2025Updated 6 months ago
walledai / walledeval
View on GitHub
Test LLMs against jailbreaks and unprecedented harms
☆40Oct 19, 2024Updated last year
Allen-piexl / JailbreakZoo
View on GitHub
☆164Sep 2, 2024Updated last year
TrustAIRLab / HateBench
View on GitHub
[USENIX'25] HateBench: Benchmarking Hate Speech Detectors on LLM-Generated Content and Hate Campaigns
☆13Mar 1, 2025Updated last year
WebGoat / WebWolf
View on GitHub
☆11Jul 10, 2024Updated last year
mike-goodwin / owasp-threat-dragon-core
View on GitHub
OWASP Threat Dragon core files
☆11Jan 26, 2026Updated last month
aaronmccall / cvss
View on GitHub
A CommonJS library for working with Common Vulnerability Scoring System vectors and scores.
☆12Jul 14, 2022Updated 3 years ago
yetingli / PoCs
View on GitHub
A list of CVE's with Proof of Concepts
☆11Jun 17, 2021Updated 4 years ago
ackul / CrimeCheck
View on GitHub
The CRIME and BREACH Attacks work against SSL and HTTP Compression. They leverage specific properties of used compression functions and c…
☆17Dec 29, 2013Updated 12 years ago
pete911 / zip-bomb
View on GitHub
scripts to create zip bombs
☆12Nov 26, 2012Updated 13 years ago
Lucas-TY / llm_Implicit_reference
View on GitHub
Official Implementation of implicit reference attack
☆11Oct 16, 2024Updated last year
kannkyo / nvd-api
View on GitHub
NVD API 2.0 for python
☆12Sep 26, 2024Updated last year
holisticinfosec / EPSScall
View on GitHub
EPSScall
☆11Jun 10, 2022Updated 3 years ago
kondukto-io / semgrep-rules
View on GitHub
Custom semgrep rules registry
☆14Aug 23, 2022Updated 3 years ago
peterwestuw / GPT2ForwardBackward
View on GitHub
Code for running forward and backward versions of GPT2
☆10Nov 20, 2021Updated 4 years ago
KatherLab / prompt_injection_attacks
View on GitHub
☆13Dec 28, 2024Updated last year
jbaines-r7 / theway
View on GitHub
A tool for extracting, modifying, and crafting ASDM binary packages (CVE-2022-20829)
☆13Aug 15, 2022Updated 3 years ago
swissiety / JimpleLSP
View on GitHub
This is an implementation of the Language Server Protocol for Jimple. It enables your IDE to provide code exploring features while workin…
☆12Dec 15, 2023Updated 2 years ago
vulnerability-lookup / PyVulnerabilityLookup
View on GitHub
Python client and module for Vulnerability-Lookup.
☆14Dec 9, 2025Updated 3 months ago
aiPenguin / StopReasoning
View on GitHub
☆14Oct 6, 2024Updated last year
JuliaComputing / semgrep-rules-julia
View on GitHub
Julia rules for semgrep
☆14Dec 9, 2025Updated 3 months ago
dzungvpham / awesome-llm4privacy
View on GitHub
A curated collection of papers and related projects on using LLMs for privacy.
☆25Oct 8, 2025Updated 5 months ago
adobe / buildrunner
View on GitHub
Build and publish Docker images, run builds/tasks within Docker containers or on remote hosts.
☆16Updated this week
pkfrom / USBRelay
View on GitHub
USBRelay Development libraries external usage
☆11Mar 1, 2016Updated 10 years ago
jtesta / bitclamp
View on GitHub
Bitclamp allows arbitrary files to be permanently and anonymously published into the Bitcoin and Dogecoin blockchains.
☆14Jan 12, 2017Updated 9 years ago
ChrisCho-H / bithoven
View on GitHub
Bithoven is a smart contract language for composing powerful and secure instruments on Bitcoin. LR(1) parser with static analysis for com…
☆41Feb 25, 2026Updated last week
umd-huang-lab / VLM-Poisoning
View on GitHub
Code for Neurips 2024 paper "Shadowcast: Stealthy Data Poisoning Attacks Against Vision-Language Models"
☆59Jan 15, 2025Updated last year
ashwinraghav / Cqual
View on GitHub
Type qualifiers for C
☆16Sep 21, 2011Updated 14 years ago
cygonz0 / orion-webshell-detector
View on GitHub
This work-in-progress "Orion Webshell Detector" was created with the intention of assisting web application code reviews coded in PHP, AS…
☆13Oct 28, 2014Updated 11 years ago
ianramzy / ticker-iq
View on GitHub
📈 Stock screener and portfolio analyzer, providing key insights on financial reports, news articles and more!
☆13Jun 24, 2019Updated 6 years ago
OWASP / CodeReviewGuide
View on GitHub
Repository for OWASP Code Review document
☆17Jun 24, 2014Updated 11 years ago
Bowen1911 / xJailbreak
View on GitHub
Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"
☆17Feb 17, 2026Updated 2 weeks ago
pulkit21 / wappalyzer
View on GitHub
Detect the server side language used for the website
☆13Mar 23, 2021Updated 4 years ago
actuated / check-smb-signing
View on GitHub
Shell script to automate running the Nmap smb-security-mode.nse or RunFinger.py by lgandx and parse results into counts and lists of host…
☆14Nov 3, 2017Updated 8 years ago
Bosco-Lam / BruteXss
View on GitHub
Based on shawarkhanethicalhacker/BruteXSS
☆15Jan 24, 2019Updated 7 years ago