Research on "Many-Shot Jailbreaking" in Large Language Models (LLMs). It unveils a novel technique capable of bypassing the safety mechanisms of LLMs, including those developed by Anthropic and other leading AI organizations. Resources
☆16Aug 6, 2024Updated last year
Alternatives and similar repositories for Many-Shot-Jailbreaking-Demo
Users that are interested in Many-Shot-Jailbreaking-Demo are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Diagnostic Framework for LLMs and MLLMs☆36Mar 2, 2026Updated last month
- ICCV 2023 - AdaptGuard: Defending Against Universal Attacks for Model Adaptation☆11Dec 23, 2023Updated 2 years ago
- This repository is the replication package of the NeurIPS19 paper "MarginGAN: Adversarial Training in Semi-Supervised Learning"☆12Oct 27, 2019Updated 6 years ago
- A (M5)Launcher port for the Tab5 device☆19Dec 14, 2025Updated 4 months ago
- Github repo for One-shot Neural Backdoor Erasing via Adversarial Weight Masking (NeurIPS 2022)☆15Jan 3, 2023Updated 3 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆18Oct 7, 2022Updated 3 years ago
- Python tool for large scale git analysis. Inspired by gitrob.☆21Jun 12, 2020Updated 5 years ago
- Plugins for the pwnagotchi☆14Dec 29, 2024Updated last year
- A repo for LLM jailbreak☆14Sep 5, 2023Updated 2 years ago
- 🛡️ Detect and respond to security threats in real-time with God-Eye, an AI-driven tool designed for privacy and local deployment on mult…☆39Updated this week
- [Under Development] This is a modification to the Ubertooth One firmware developed by Great Scott Gadgets. The modified firmware is able …☆22Feb 22, 2022Updated 4 years ago
- This MCP server acts as a bridge between the official Hacker News API and AI-powered tools that support the Model Context Protocol, such …☆32Jun 9, 2025Updated 10 months ago
- Code of paper [CVPR'24: Can Protective Perturbation Safeguard Personal Data from Being Exploited by Stable Diffusion?]☆25Apr 2, 2024Updated 2 years ago
- The repo for using the model https://huggingface.co/thu-coai/Attacker-v0.1☆13Apr 23, 2025Updated 11 months ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Accepted by ECCV 2024☆201Oct 15, 2024Updated last year
- ☆30May 22, 2024Updated last year
- ☆18Mar 30, 2025Updated last year
- ☆17Sep 7, 2024Updated last year
- A lifecycle guard skill.☆140Mar 27, 2026Updated 3 weeks ago
- Unlock unlimited Cursor AI trials on any platform. Scripts and helper tools to reset Cursor AI’s config, spin up fresh 150-trial sessions…☆34May 22, 2025Updated 10 months ago
- Pytorch implementation and comparison of Fourier Feature Networks and Sinusoidal Representation Networks☆13Jun 27, 2020Updated 5 years ago
- [CVPR 2024] official code for SimAC☆21Jan 23, 2025Updated last year
- Code for ICCV 2023 work "Generalized Few-Shot Point Cloud Segmentation Via Geometric Words"☆14Sep 26, 2023Updated 2 years ago
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Implementation codes for KDD24 paper "LLM4DyG: Can Large Language Models Solve Spatial-Temporal Problems on Dynamic Graphs?"☆33Sep 10, 2024Updated last year
- ☆25Apr 26, 2023Updated 2 years ago
- [CVPR'24 Oral] Metacloak: Preventing Unauthorized Subject-driven Text-to-image Diffusion-based Synthesis via Meta-learning☆30Nov 19, 2024Updated last year
- AIBO Battery SMBus Reader☆17Jan 1, 2018Updated 8 years ago
- ☆24Feb 17, 2026Updated 2 months ago
- ☆15May 23, 2024Updated last year
- CVPR2023: Unlearnable Clusters: Towards Label-agnostic Unlearnable Examples☆22Apr 25, 2023Updated 2 years ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- [CVPR 2023] The official implementation of our CVPR 2023 paper "Detecting Backdoors During the Inference Stage Based on Corruption Robust…☆25May 25, 2023Updated 2 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- ☆126Mar 26, 2026Updated 3 weeks ago
- [ICLR 2023, Spotlight] Indiscriminate Poisoning Attacks on Unsupervised Contrastive Learning☆31Dec 2, 2023Updated 2 years ago
- Code implementation for paper "Can Large Language Models Empower Molecular Property Prediction?"☆39Jul 14, 2023Updated 2 years ago
- The most comprehensive and accurate LLM jailbreak attack benchmark by far☆21Mar 22, 2025Updated last year
- ☆23Feb 3, 2026Updated 2 months ago
- ☆13Oct 5, 2025Updated 6 months ago
- Open-source implementation of Google's TurboQuant (ICLR 2026) — KV cache compression to 2.5–4 bits with near-zero quality loss. 3.8–5.7x …☆48Mar 29, 2026Updated 3 weeks ago