zeroeval / zeroeval-sdkLinks
☆14Updated 3 weeks ago
Alternatives and similar repositories for zeroeval-sdk
Users that are interested in zeroeval-sdk are comparing it to the libraries listed below
Sorting:
- An MCP server that autonomously evaluates web applications.☆1,202Updated last week
- Add natural language control to your React app, with MCP and generative UX☆788Updated last week
- A Python library for LLM-based evaluation using weighted rubrics.☆39Updated this week
- 🔥 Reliable Browser AI Agents (YC S25)☆1,659Updated this week
- Visually inspect MCP servers☆1,307Updated last week
- Run Claude Code, Gemini, Codex — or any coding agent — in a clean, isolated sandbox with sensitive data redaction and observability baked…☆1,492Updated 3 weeks ago
- Modelence is an all-in-one TypeScript platform. We're building a Supabase alternative for MongoDB developers shipping production apps, wi…☆312Updated last week
- Cursor extension that forwards frontend errors and screenshots to composer in one-click, making development seamless for you. Download he…☆276Updated 6 months ago
- Using LLMs to transpile from Coq to Lean (public version, may be out of date)☆19Updated last month
- Emdash is an orchestration layer for running multiple coding agents in parallel in isolated Git worktrees☆619Updated last week
- Production-Ready MCP Server Framework • Build, deploy & scale secure AI agent infrastructure • Includes Auth, Observability, Debugger, Te…☆789Updated this week
- Postman for MCP servers☆122Updated 3 months ago
- The Intelligence Layer for AI agents. Connect your models, tools, and data to create agentic apps that can think, act and talk to you.☆452Updated this week
- The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced.☆452Updated 3 months ago
- Omnara (YC S25) - Talk to Your AI Agents from Anywhere!☆2,476Updated last week
- Visual editor for Cursor. Send UI comments and screenshots directly to Cursor as prompts.☆380Updated 2 weeks ago
- Lilac is an open-source tool that ensures your data scientists always have enough gpus for their work. We seamlessly connect compute from…☆115Updated 2 months ago
- Run multiple Codex and Claude Code AI sessions in parallel git worktrees. Test, compare approaches & manage AI-assisted development workf…☆2,392Updated this week
- claude code system prompt☆667Updated 3 months ago
- AI Browser Automation☆786Updated last week
- Build your personal memory system to power your AI apps.☆915Updated this week
- Open source implementation of Poke☆366Updated last month
- ☆20Updated this week
- Query MCP enables end-to-end management of Supabase via chat interface: read & write query executions, management API support, automatic …☆805Updated last month
- A minimalistic MCP client with a good feature set.☆812Updated 3 months ago
- Autumn is an open-source pricing & billing platform☆2,088Updated this week
- A Python framework that emulates Grok Heavy functionality using intelligent multi-agent orchestration. Deploy 4 (or more) specialized AI …☆1,026Updated 3 months ago
- Exa MCP for web search and web crawling!☆3,196Updated this week
- Open chat interface for all your models.☆1,258Updated last month
- ☆567Updated last month