Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechanisms of LLMs, including those developed by Anthropic and other leading AI organizations. Resources
☆16Aug 6, 2024Updated last year
Alternatives and similar repositories for Many-Shot-Jailbreaking-Demo
Users that are interested in Many-Shot-Jailbreaking-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- ASCII Smuggling Hidden Prompt Injection is a novel approach to hacking AI assistants using Unicode Tags. This project demostrate how to u…☆19Aug 7, 2024Updated last year
- Official implementation of Visco-Attack (EMNLP 2025 Main). An open-source one-click reproduction script is also provided.☆30Apr 11, 2026Updated 2 months ago
- ICCV 2023 - AdaptGuard: Defending Against Universal Attacks for Model Adaptation☆11Dec 23, 2023Updated 2 years ago
- ☆13Sep 24, 2023Updated 2 years ago
- This repository is the replication package of the NeurIPS19 paper "MarginGAN: Adversarial Training in Semi-Supervised Learning"☆12Oct 27, 2019Updated 6 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- ☆13Mar 16, 2025Updated last year
- Github repo for One-shot Neural Backdoor Erasing via Adversarial Weight Masking (NeurIPS 2022)☆15Jan 3, 2023Updated 3 years ago
- Automated Wifi Hacking Tool☆13Mar 16, 2021Updated 5 years ago
- [ICML 2025] Official repository for paper "OR-Bench: An Over-Refusal Benchmark for Large Language Models"☆28Mar 4, 2025Updated last year
- ☆18Oct 7, 2022Updated 3 years ago
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- This MCP server acts as a bridge between the official Hacker News API and AI-powered tools that support the Model Context Protocol, such …☆32Jun 9, 2025Updated last year
- Code of paper [CVPR'24: Can Protective Perturbation Safeguard Personal Data from Being Exploited by Stable Diffusion?]☆26Apr 2, 2024Updated 2 years ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated last year
- Deploy open-source AI quickly and easily - Special Bonus Offer • AdRunpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
- ☆13Jul 13, 2024Updated last year
- Robust Point Cloud Processing through Positional Embedding☆13Sep 7, 2023Updated 2 years ago
- ☆30May 22, 2024Updated 2 years ago
- ☆18Mar 30, 2025Updated last year
- [Under Development] This is a modification to the Ubertooth One firmware developed by Great Scott Gadgets. The modified firmware is able …☆23Feb 22, 2022Updated 4 years ago
- ☆18Sep 7, 2024Updated last year
- Pytorch implementation and comparison of Fourier Feature Networks and Sinusoidal Representation Networks☆13Jun 27, 2020Updated 5 years ago
- Implementation codes for KDD24 paper "LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?"☆34Sep 10, 2024Updated last year
- [CVPRW 2023] Diversity is Definitely Needed: Improving Model-Agnostic Zero-shot Classification via Stable Diffusion☆24Jan 24, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆25Apr 26, 2023Updated 3 years ago
- A lifecycle guard skill.☆181Mar 27, 2026Updated 2 months ago
- Unlock unlimited Cursor AI trials on any platform. Scripts and helper tools to reset Cursor AI’s config, spin up fresh 150-trial sessions…☆49May 22, 2025Updated last year
- The multi-view version of MonoDETR on nuScenes dataset☆21Nov 4, 2022Updated 3 years ago
- AIBO Battery SMBus Reader☆18Jan 1, 2018Updated 8 years ago
- ☆24Feb 17, 2026Updated 4 months ago
- Code for Transferable Unlearnable Examples☆22Mar 11, 2023Updated 3 years ago
- CVPR2023: Unlearnable Clusters: Towards Label-agnostic Unlearnable Examples☆22Apr 25, 2023Updated 3 years ago
- ☆29Jun 17, 2024Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- [CVPR 2023] The official implementation of our CVPR 2023 paper "Detecting Backdoors During the Inference Stage Based on Corruption Robust…☆25May 25, 2023Updated 3 years ago
- ☆12Jul 18, 2022Updated 3 years ago
- This repository contains a PyTorch implementation of Latent Diffusion Transformer for Point Cloud Generation☆15Nov 7, 2023Updated 2 years ago
- [ACL 2024] ValueBench: Towards Comprehensively Evaluating Value Orientations and Understanding of Large Language Models☆27Jan 11, 2025Updated last year
- HackTheWeb is a production-ready, AI-powered web application penetration testing tool designed for security professionals and ethical hac…☆35Oct 5, 2025Updated 8 months ago
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆21Mar 22, 2025Updated last year
- ☆23Feb 3, 2026Updated 4 months ago