eth-sri / ToolFuzzLinks
ToolFuzz is a fuzzing framework designed to test your LLM Agent tools.
☆20Updated 4 months ago
Alternatives and similar repositories for ToolFuzz
Users that are interested in ToolFuzz are comparing it to the libraries listed below
Sorting:
- A better way of testing, inspecting, and analyzing AI Agent traces.☆39Updated last week
- TaskTracker is an approach to detecting task drift in Large Language Models (LLMs) by analysing their internal activations. It provides a…☆60Updated 4 months ago
- Guardrails for secure and robust agent development☆316Updated last month
- Visualize any repo or codebase into diagram or animation☆18Updated 9 months ago
- Let Claude control a web browser on your machine.☆34Updated last month
- ☆96Updated 10 months ago
- RAGLight is a lightweight and modular Python library for implementing Retrieval-Augmented Generation (RAG), Agentic RAG and RAT (Retrieva…☆25Updated 3 months ago
- Test Generation for Prompts☆107Updated 2 weeks ago
- AI-powered computer control for automated testing. Factifai uses vision models (Claude, GPT-4o, Gemini) to interact with applications nat…☆41Updated 3 weeks ago
- a Python library that uses Reinforcement Learning (RL) to train LLMs.☆33Updated last month
- QLLM: A powerful CLI for seamless interaction with multiple Large Language Models. Simplify AI workflows, streamline development, and unl…☆33Updated 3 months ago
- CursorCore: Assist Programming through Aligning Anything☆127Updated 5 months ago
- A task management system designed for AI development☆49Updated last month
- Building Agents with LLM structured generation (BAML), MCP Tools, and 12-Factor Agents principles☆26Updated 2 weeks ago
- ☆51Updated 3 weeks ago
- ☆51Updated last month
- Contains the prompts we use to talk to various LLMs for different utilities inside the editor☆79Updated last year
- LLM-based mutation testing☆11Updated 5 months ago
- 😎 Awesome list of resources about using and building AI software development systems☆111Updated last year
- Specification for creating reliable LLM-based conversational agents☆50Updated last week
- A framework for hosting and scaling AI agents.☆36Updated 7 months ago
- Unofficial Claude Code SDKs for Typescript and Python☆15Updated 2 months ago
- Probably one of the lightest native RAG + Agent apps out there,experience the power of Agent-powered models and Agent-driven knowledge ba…☆28Updated last month
- Enhancing AI Software Engineering with Repository-level Code Graph☆191Updated 3 months ago
- A simple ReAct agent that has access to LlamaIndex docs and to the internet to provide you with insights on LlamaIndex itself.☆11Updated 4 months ago
- Harness used to benchmark aider against SWE Bench benchmarks☆72Updated last year
- AgentFence is an open-source platform for automatically testing AI agent security. It identifies vulnerabilities such as prompt injection…☆15Updated 4 months ago
- TDD-Bench-Verified is a new benchmark for generating test cases for test-driven development (TDD)☆21Updated 2 weeks ago
- Example implementation of Iteration of Tought - Gives a star if you like the project☆42Updated 6 months ago
- Deep Research through Multi-Agents, using GraphRAG☆76Updated 8 months ago