The opensoure repository of FuzzLLM
☆36May 4, 2024Updated last year
Alternatives and similar repositories for FuzzLLM
Users that are interested in FuzzLLM are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Multi-dimensional analysis of orthogonal safety directions in LLM alignment☆21Mar 20, 2025Updated last year
- ☆44Feb 26, 2025Updated last year
- [EMNLP 2025] The code repo of paper "X-Boundary: Establishing Exact Safety Boundary to Shield LLMs from Multi-Turn Jailbreaks without Com…☆40Nov 24, 2025Updated 4 months ago
- ☆38Oct 2, 2024Updated last year
- [ICLR 2024] The official implementation of our ICLR2024 paper "AutoDAN: Generating Stealthy Jailbreak Prompts on Aligned Large Language M…☆434Jan 22, 2025Updated last year
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- csl: PyTorch-based Constrained Learning☆11Jun 1, 2022Updated 3 years ago
- The dataset CoLan-150K and the concept decomposition in the paper Concept Lancet (CVPR 2025)☆20Jan 18, 2026Updated 2 months ago
- Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning☆44Mar 18, 2026Updated last week
- [USENIX Security '24] An LLM-Assisted Easy-to-Trigger Backdoor Attack on Code Completion Models: Injecting Disguised Vulnerabilities agai…☆57Mar 22, 2025Updated last year
- aigc evals☆10Dec 2, 2023Updated 2 years ago
- ☆24Sep 5, 2023Updated 2 years ago
- ☆17Oct 29, 2021Updated 4 years ago
- RAG Hallucination Detecting By LRP.☆11Mar 31, 2025Updated 11 months ago
- ☆24Jan 27, 2026Updated last month
- GPU virtual machines on DigitalOcean Gradient AI • AdGet to production fast with high-performance AMD and NVIDIA GPUs you can spin up in seconds. The definition of operational simplicity.
- Your finetuned model's back to its original safety standards faster than you can say "SafetyLock"!☆11Oct 16, 2024Updated last year
- ☆11Apr 3, 2024Updated last year
- Unofficail pytorch implementation of BigBiGAN☆11Mar 26, 2021Updated 5 years ago
- Our research proposes a novel MoGU framework that improves LLMs' safety while preserving their usability.☆18Jan 14, 2025Updated last year
- ☆18Mar 11, 2026Updated 2 weeks ago
- The repo for paper: Exploiting the Index Gradients for Optimization-Based Jailbreaking on Large Language Models.☆14Dec 16, 2024Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆39Jan 12, 2024Updated 2 years ago
- [ACL 2025] LongSafety: Evaluating Long-Context Safety of Large Language Models☆16Jun 18, 2025Updated 9 months ago
- ☆19Feb 11, 2022Updated 4 years ago
- Proton VPN Special Offer - Get 70% off • AdSpecial partner offer. Trusted by over 100 million users worldwide. Tested, Approved and Recommended by Experts.
- [EMNLP 2025] Reasoning-to-Defend: Safety-Aware Reasoning Can Defend Large Language Models from Jailbreaking☆12Aug 22, 2025Updated 7 months ago
- Repo containing documentation and explanation for CSET's harm taxonomy of incidents from AIID.☆19Jun 21, 2024Updated last year
- ☆12Sep 23, 2024Updated last year
- Userspace eBPF Runtime Benchmarking Test Suite and Results☆16Updated this week
- The official implement of paper S2-VER: Semi-Supervised Visual Emotion Recognition☆11Apr 28, 2024Updated last year
- **ASCM4ABSA** - Our code and proposed data for NLPCC 2022 paper titled "Aspect-specific Context Modeling for Aspect-based Sentiment Analy…☆12Mar 26, 2023Updated 3 years ago
- Notes on optimizing the linux kernel function csum_partial☆14Nov 28, 2021Updated 4 years ago
- Feature Structure Distillation with Centered Kernel Alignment in BERT Transferring official code☆11Jul 17, 2023Updated 2 years ago
- Designed by ChangWenhan (China University of Geosciences)☆13Mar 28, 2021Updated 4 years ago
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click and start building anything your business needs.
- Repository of the paper 'CodeQueries: A Dataset of Semantic Queries over Code' published in ISEC 2024☆13Apr 21, 2024Updated last year
- Foolbox implementation for NeurIPS 2021 Paper: "Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints".☆24Mar 16, 2022Updated 4 years ago
- This is a repository for code, data, and models associated with the paper LLM-RUBRIC: A Multidimensional, Calibrated Approach to Automate…☆26Feb 18, 2025Updated last year
- SG-Bench: Evaluating LLM Safety Generalization Across Diverse Tasks and Prompt Types☆24Nov 29, 2024Updated last year
- Experiments with representation engineering☆14Feb 28, 2024Updated 2 years ago
- Code for ICCV2025 paper——IDEATOR: Jailbreaking and Benchmarking Large Vision-Language Models Using Themselves☆17Jul 11, 2025Updated 8 months ago
- ☆23Jun 12, 2023Updated 2 years ago