jiaxiaojunQAQ / I-GCGView external linksLinks
Improved techniques for optimization-based jailbreaking on large language models (ICLR2025)
☆142Apr 7, 2025Updated 10 months ago
Alternatives and similar repositories for I-GCG
Users that are interested in I-GCG are comparing it to the libraries listed below
Sorting:
- Improving fast adversarial training with prior-guided knowledge (TPAMI2024)☆43Apr 21, 2024Updated last year
- Code for Semantic-Aligned Adversarial Evolution Triangle for High-Transferability Vision-Language Attack(TPAMI 2025)☆43Aug 28, 2025Updated 5 months ago
- AmpleGCG: Learning a Universal and Transferable Generator of Adversarial Attacks on Both Open and Closed LLM☆84Nov 3, 2024Updated last year
- Revisiting and Exploring Efficient Fast Adversarial Training via LAW: Lipschitz Regularization and Auto Weight Averaging (TIFS2024)☆37Jun 4, 2024Updated last year
- ☆23Jan 17, 2025Updated last year
- Improved Few-Shot Jailbreaking Can Circumvent Aligned Language Models and Their Defenses (NeurIPS 2024)☆65Jan 11, 2025Updated last year
- [NeurIPS 2024] Accelerating Greedy Coordinate Gradient and General Prompt Optimization via Probe Sampling☆33Nov 8, 2024Updated last year
- Code repo of our paper Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis (https://arxiv.org/abs/2406.10794…☆23Jul 26, 2024Updated last year
- A fast + lightweight implementation of the GCG algorithm in PyTorch☆317May 13, 2025Updated 9 months ago
- [NDSS'24] Inaudible Adversarial Perturbation: Manipulating the Recognition of User Speech in Real Time☆56Sep 28, 2024Updated last year
- ☆14Feb 26, 2025Updated 11 months ago
- Code for Fast Propagation is Better: Accelerating Single-Step Adversarial Training via Sampling Subnetworks (TIFS2024)☆13Mar 29, 2024Updated last year
- TACL 2025: Investigating Adversarial Trigger Transfer in Large Language Models☆19Aug 17, 2025Updated 5 months ago
- ☆57Jun 5, 2024Updated last year
- REINFORCE Adversarial Attacks on Large Language Models: An Adaptive, Distributional, and Semantic Objective☆21Feb 28, 2025Updated 11 months ago
- The official repository for guided jailbreak benchmark☆28Jul 28, 2025Updated 6 months ago
- [ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping".☆163May 2, 2025Updated 9 months ago
- A Unified Benchmark and Toolbox for Multimodal Jailbreak Attack–Defense Evaluation☆57Jan 23, 2026Updated 3 weeks ago
- nsq_go适用于集群版nsq的场景 ,内部封装了nsq的生产者和消费者,生产者根据nsqlookupd发现连接nsqd🚀☆10Oct 10, 2022Updated 3 years ago
- Emoji Attack [ICML 2025]☆41Jul 15, 2025Updated 7 months ago
- Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, data…☆1,205Feb 6, 2026Updated last week
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- ☆109Feb 16, 2024Updated 2 years ago
- [ICML 2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast☆118Mar 26, 2024Updated last year
- 助你快速开发网页!让世界上没有难做的网页!☆110Dec 5, 2025Updated 2 months ago
- ☆40Sep 23, 2024Updated last year
- ☆29Mar 28, 2023Updated 2 years ago
- [IROS 2024] SCANet: Correcting LEGO Assembly Errors with Self-Correct Assembly Network (FINALIST BEST APPLICATION PAPER)☆23Oct 26, 2024Updated last year
- This is repository to introduce Vision-based cart for obstacle avoidance navigation, using Airsim and Tensorflow.☆13Jun 29, 2023Updated 2 years ago
- [ICLR 2025] Cheating Automatic LLM Benchmarks: Null Models Achieve High Win Rates (Oral)☆84Oct 23, 2024Updated last year
- Ensure that only legitimate users can access the API☆19Jul 10, 2024Updated last year
- Code for safety test in "Keeping LLMs Aligned After Fine-tuning: The Crucial Role of Prompt Templates"☆22Sep 21, 2025Updated 4 months ago
- Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]☆377Jan 23, 2025Updated last year
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆427Jan 22, 2025Updated last year
- [AAAI 2021] VMLoc: Variational Fusion For Learning-Based Multimodal Camera Localization☆31Oct 27, 2024Updated last year
- Official repository for ICML 2024 paper "On Prompt-Driven Safeguarding for Large Language Models"☆106May 20, 2025Updated 8 months ago
- Code repository for the paper "Heuristic Induced Multimodal Risk Distribution Jailbreak Attack for Multimodal Large Language Models"☆15Aug 7, 2025Updated 6 months ago
- 跃入Spring的汪洋大海,探寻Spring Boot与各种框架的完美融合之道,从基础到高级,涵盖Spring Boot、Spring Boot & Shiro、Spring Security Oauth2、Spring Cloud等等,简洁而易懂的示例,带你领略Sprin…☆29Sep 1, 2023Updated 2 years ago
- ☆32Feb 21, 2023Updated 2 years ago