eth-sri / ToolFuzz
ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.
☆14Updated 2 weeks ago
Alternatives and similar repositories for ToolFuzz:
Users that are interested in ToolFuzz are comparing it to the libraries listed below
- A better way of testing, inspecting, and analyzing AI Agent traces.☆30Updated this week
- A framework-less approach to robust agent development.☆156Updated this week
- Accompanying code and SEP dataset for the "Can LLMs Separate Instructions From Data? And What Do We Even Mean By That?" paper.☆50Updated 3 weeks ago
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆50Updated 3 weeks ago
- [ICLR'24 Spotlight] A language model (LM)-based emulation framework for identifying the risks of LM agents with tool use☆133Updated last year
- Does Refusal Training in LLMs Generalize to the Past Tense? [ICLR 2025]☆66Updated 2 months ago
- ☆25Updated 3 weeks ago
- Repo for the research paper "SecAlign: Defending Against Prompt Injection with Preference Optimization"☆41Updated 2 months ago
- ☆50Updated 4 months ago
- ☆13Updated last year
- Enhancing AI Software Engineering with Repository-level Code Graph☆149Updated 2 months ago
- ☆37Updated 2 weeks ago
- Official code repository for Sketch-of-Thought (SoT)☆94Updated this week
- Challenges for general-purpose web-browsing AI agents☆45Updated last month
- Source code for paper: INTERVENOR : Prompt the Coding Ability of Large Language Models with the Interactive Chain of Repairing☆26Updated 4 months ago
- Red-Teaming Language Models with DSPy☆175Updated last month
- Code interpreter support for o1☆32Updated 6 months ago
- AISafetyLab: A comprehensive framework covering safety attack, defense, evaluation and paper list.☆104Updated this week
- Moatless Testbeds allows you to create isolated testbed environments in a Kubernetes cluster where you can apply code changes through git…☆10Updated last month
- Contains the model patches and the eval logs from the passing swe-bench-lite run.☆10Updated 9 months ago
- ☆56Updated 6 months ago
- [NDSS'25 Poster] A collection of automated evaluators for assessing jailbreak attempts.☆133Updated 3 weeks ago
- Test LLMs against jailbreaks and unprecedented harms☆26Updated 5 months ago
- Code for ScribeAgent paper☆54Updated 3 weeks ago
- ☆24Updated 2 months ago
- Weak-to-Strong Jailbreaking on Large Language Models☆72Updated last year
- A data construction and evaluation framework to quantify privacy norm awareness of language models (LMs) and emerging privacy risk of LM …☆25Updated 3 weeks ago
- Implementation of the paper: "AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks?"☆52Updated 3 months ago
- The official repo for the code and data of paper SMART☆22Updated last month
- AI conflict resolution framework designed to work alongside existing AI orchestration tools☆23Updated 3 months ago