AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
☆950Jun 24, 2026Updated last week
Alternatives and similar repositories for autoevals
Users that are interested in autoevals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluate your LLM-powered apps with TypeScript☆1,608Apr 28, 2026Updated 2 months ago
- ☆58Updated this week
- JavaScript Tracing & Evals library for Braintrust☆23Updated this week
- The TypeScript LLM Evaluation Library☆162Nov 11, 2025Updated 7 months ago
- ☆401Updated this week
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Prompt design using JSX.☆2,836Oct 15, 2025Updated 8 months ago
- structured outputs for llms☆13,210Jun 23, 2026Updated last week
- Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, De…☆22,608Updated this week
- A vitest extension for running evals.☆272Jun 22, 2026Updated last week
- The LLM Evaluation Framework☆16,516Updated this week
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- Python SDK for running evaluations on LLM generated responses☆300Jun 6, 2025Updated last year
- The pretty much "official" DSPy framework for Typescript☆2,787Jun 23, 2026Updated last week
- Laminar - open-source observability platform purpose-built for AI agents. YC S24.☆3,043Updated this week
- Managed Kubernetes at scale on DigitalOcean • AdDigitalOcean Kubernetes includes the control plane, bandwidth allowance, container registry, automatic updates, and more for free.
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆5,857Jun 11, 2026Updated 2 weeks ago
- A lightweight React Hook intended mainly for AI chat applications, for smoothly sticking to bottom of messages☆748Jun 4, 2026Updated 3 weeks ago
- OTEL ingestion running on Cloudflare Workers☆49Apr 8, 2025Updated last year
- DSPy: The framework for programming—not prompting—language models☆35,605Updated this week
- 🪢 Open source AI engineering platform: LLM evals, observability, metrics, prompt management, playground, datasets. Integrates with OpenT…☆29,792Updated this week
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)☆8,437Updated this week
- A Workers AI provider for the vercel AI SDK☆113Mar 18, 2025Updated last year
- AI Observability & Evaluation☆10,254Jun 24, 2026Updated last week
- Mastra is the modern TypeScript framework for AI-powered applications and agents.☆25,558Updated this week
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- Structured Outputs☆14,273Updated this week
- The SDK For Browser Agents☆23,230Jun 24, 2026Updated last week
- The platform for LLM evaluations and AI agent testing☆3,313Updated this week
- Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.☆18,783Apr 14, 2026Updated 2 months ago
- Supercharge Your LLM Application Evaluations 🚀☆14,523Feb 24, 2026Updated 4 months ago
- ☆80Jun 5, 2024Updated 2 years ago
- The leading workflow orchestration platform. Run stateful step functions and AI workflows on serverless, servers, or the edge.☆5,530Updated this week
- Wow!☆12Oct 25, 2024Updated last year
- Developer toolkit that makes it simple to build with the Workers AI platform.☆193Jun 8, 2026Updated 3 weeks ago
- AI Agents on DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- Supercharge your local development☆370Oct 8, 2025Updated 8 months ago
- an ambient intelligence library☆6,174May 12, 2026Updated last month
- AI Hero's open-source examples and course material. Learn AI Engineering with a single repo.☆1,537Jun 20, 2026Updated last week
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Sep 29, 2024Updated last year
- Readymade evaluators for agent trajectories☆629Jun 17, 2026Updated last week
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆51,475Updated this week
- The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered application…☆25,191Updated this week