The opensoure repository of FuzzLLM
☆36May 4, 2024Updated last year
Alternatives and similar repositories for FuzzLLM
Users that are interested in FuzzLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆22Mar 20, 2025Updated last year
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆436Jan 22, 2025Updated last year
- ☆22Aug 6, 2023Updated 2 years ago
- csl: PyTorch-based Constrained Learning☆11Jun 1, 2022Updated 3 years ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 2 months ago
- Wordpress hosting with auto-scaling - Free Trial • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆45Mar 18, 2026Updated 3 weeks ago
- [Tensorflow] A Game Theoretic approach using GAN for Phishing URL synthesis and detection☆11Nov 14, 2022Updated 3 years ago
- A Python tool to visualize the global distribution of your academic citations.☆24Nov 24, 2025Updated 4 months ago
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…☆58Mar 22, 2025Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- ☆17Oct 29, 2021Updated 4 years ago
- Extensible Platform for Malware Analysis☆17Jan 14, 2021Updated 5 years ago
- Navigation Turing Test (NTT): Learning to Evaluate Human-Like Navigation [ICML 2021]☆15Jul 17, 2025Updated 8 months ago
- ☆11Apr 3, 2024Updated 2 years ago
- GPUs on demand by Runpod - Special Offer Available • AdRun AI, ML, and HPC workloads on powerful cloud GPUs—without limits or wasted spend. Deploy GPUs in under a minute and pay by the second.
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 9 months ago
- ☆19Feb 11, 2022Updated 4 years ago
- 🔐 A list of anonymity papers published from 2012 to 2025.☆17Nov 26, 2025Updated 4 months ago
- ☆12Sep 23, 2024Updated last year
- ☆14Sep 28, 2023Updated 2 years ago
- Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring official code☆11Jul 17, 2023Updated 2 years ago
- Virtual machines for every use case on DigitalOcean • AdGet dependable uptime with 99.99% SLA, simple security tools, and predictable monthly pricing with DigitalOcean's virtual machines, called Droplets.
- Designed by ChangWenhan (China University of Geosciences)☆13Mar 28, 2021Updated 5 years ago
- Jupyter Kernel for CodeQL☆15Feb 26, 2025Updated last year
- Official repository for the paper "Gradient-based Jailbreak Images for Multimodal Fusion Models" (https//arxiv.org/abs/2410.03489)☆19Oct 22, 2024Updated last year
- Official implementation for UniASM: Binary Code Similarity Detection without Fine-tuning.☆20Apr 6, 2023Updated 3 years ago
- Foolbox implementation for NeurIPS 2021 Paper: "Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints".☆24Mar 16, 2022Updated 4 years ago
- A python module that monkey patches pexpect mainly for binary transfers.☆18Feb 28, 2019Updated 7 years ago
- [CVPR2026 Highlight] Cubic Discrete Diffusion: Discrete Visual Generation on High-Dimensional Representation Tokens https://arxiv.org/abs…☆53Updated this week
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆29Mar 30, 2026Updated 2 weeks ago
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆25Nov 29, 2024Updated last year
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 9 months ago
- QT/C++ 计算器☆14Feb 21, 2020Updated 6 years ago
- An easy-to-use Python framework to generate adversarial jailbreak prompts.☆834Mar 30, 2026Updated 2 weeks ago
- 对BERT模型进行fine-tuning后,在chinese_L-12_H-768_A-12模型基础上进行训练,并以MSRA作为数据集测试☆18Dec 12, 2019Updated 6 years ago
- Official Repository of Personalized Visual Instruct Tuning☆34Mar 6, 2025Updated last year
- Sparse and discrete interpretability tool for neural networks☆64Feb 12, 2024Updated 2 years ago