RainJamesY/FuzzLLM

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/RainJamesY/FuzzLLM)

RainJamesY / FuzzLLM

The opensoure repository of FuzzLLM

☆37

Alternatives and similar repositories for FuzzLLM

Users that are interested in FuzzLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

vlm2-bench / VLM2-Bench
View on GitHub
VLM2-Bench [ACL 2025 Main]: A Closer Look at How Well VLMs Implicitly Link Explicit Matching Visual Cues
☆45May 20, 2025Updated last year
AI45Lab / X-Boundary
View on GitHub
[EMNLP 2025] The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Com…
☆41Nov 24, 2025Updated 7 months ago
LLMSmith / LLMSmith
View on GitHub
☆49Feb 26, 2025Updated last year
Jihuai-wpy / InferAligner
View on GitHub
Inference-time alignment for harmlessness through cross-model guidance (ACL 2024). Code + MM-Harmful Bench.
☆38Oct 2, 2024Updated last year
SheltonLiu-N / AutoDAN
View on GitHub
[ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…
☆448Jan 22, 2025Updated last year
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
GarlicFuzzer / PeachFuzzer_Pits
View on GitHub
☆22Aug 6, 2023Updated 2 years ago
lfochamon / csl
View on GitHub
csl: PyTorch-based Constrained Learning
☆11Jun 1, 2022Updated 4 years ago
peterljq / Concept-Lancet
View on GitHub
The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)
☆20Jan 18, 2026Updated 5 months ago
datasec-lab / CodeBreaker
View on GitHub
[USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…
☆59Mar 22, 2025Updated last year
mll-lab-nu / ENACT
View on GitHub
ENACT is a benchmark that evaluates embodied cognition through world modeling from egocentric interaction. It is designed to be simple an…
☆52Nov 27, 2025Updated 7 months ago
rabbitjy / FuzzTuning
View on GitHub
☆24Sep 5, 2023Updated 2 years ago
CostaliyA / Flow-OPD
View on GitHub
Official Repo of "Flow-OPD: On-Policy Distillation for Flow Matching Models"
☆257Jun 24, 2026Updated 2 weeks ago
zhu-minjun / SafetyLock
View on GitHub
Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!
☆11Oct 16, 2024Updated last year
RUIYUN-ML / ERM-KTP
View on GitHub
☆11Apr 3, 2024Updated 2 years ago
Wordpress hosting with auto-scaling - Free Trial Offer • Ad
Fully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
RKorzeniowski / BigBiGAN-PyTorch
View on GitHub
Unofficail pytorch implementation of BigBiGAN
☆11Mar 26, 2021Updated 5 years ago
DYR1 / MoGU
View on GitHub
Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.
☆18Jan 14, 2025Updated last year
AI-Hypercomputer / inference-benchmark
View on GitHub
☆21Mar 11, 2026Updated 3 months ago
jiah-li / magic
View on GitHub
The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.
☆15Dec 16, 2024Updated last year
thu-coai / LongSafety
View on GitHub
[ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models
☆16Jun 18, 2025Updated last year
RUCAIBox / RLMEC
View on GitHub
The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"
☆39Jan 12, 2024Updated 2 years ago
ixxchan / nb
View on GitHub
naïve blockchain in Rust
☆10Nov 13, 2020Updated 5 years ago
chuhac / Reasoning-to-Defend
View on GitHub
[EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking
☆12Aug 22, 2025Updated 10 months ago
georgetown-cset / CSET-AIID-harm-taxonomy
View on GitHub
Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.
☆21Jun 21, 2024Updated 2 years ago
1-Click AI Models by DigitalOcean Gradient • Ad
Deploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
weizeming / momentum-attack-llm
View on GitHub
☆25Jan 17, 2025Updated last year
NUSTM / LLMs-Waver-In-Judgments
View on GitHub
☆12Sep 23, 2024Updated last year
eunomia-bpf / bpf-benchmark
View on GitHub
AI Agent eBPF optimization benchmark and framework
☆20Jul 2, 2026Updated last week
exped1230 / S2-VER
View on GitHub
The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition
☆11Apr 28, 2024Updated 2 years ago
BD-MF / ASCM4ABSA
View on GitHub
**ASCM4ABSA** - Our code and proposed data for NLPCC 2022 paper titled "Aspect-specific Context Modeling for Aspect-based Sentiment Analy…
☆12Mar 26, 2023Updated 3 years ago
fenrus75 / csum_partial
View on GitHub
Notes on optimizing the linux kernel function csum_partial
☆14Nov 28, 2021Updated 4 years ago
maroo-sky / FSD
View on GitHub
Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring official code
☆11Jul 17, 2023Updated 2 years ago
ChangWenhan / StudentManagementSystem-Qt
View on GitHub
Designed by ChangWenhan (China University of Geosciences)
☆13Mar 28, 2021Updated 5 years ago
GitHubSecurityLab / codeql-jupyter-kernel
View on GitHub
Jupyter Kernel for CodeQL
☆15Feb 26, 2025Updated last year
Managed Database hosting by DigitalOcean • Ad
PostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
facebookresearch / multimodal-fusion-jailbreaks
View on GitHub
Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)
☆20Oct 22, 2024Updated last year
MurrayTom / SG-Bench
View on GitHub
SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types
☆26Nov 29, 2024Updated last year
mishajw / repeng
View on GitHub
Experiments with representation engineering
☆14Feb 28, 2024Updated 2 years ago
YuqingWang1029 / CubiD
View on GitHub
[CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs…
☆62Apr 10, 2026Updated 2 months ago
roywang021 / IDEATOR
View on GitHub
Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves
☆18Jul 11, 2025Updated 11 months ago
JailbreakBench / artifacts
View on GitHub
Jailbreak artifacts for JailbreakBench
☆97Nov 6, 2024Updated last year
UCLA-StarAI / GeLaTo
View on GitHub
☆24Jun 12, 2023Updated 3 years ago