AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
☆869Apr 3, 2026Updated 3 weeks ago
Alternatives and similar repositories for autoevals
Users that are interested in autoevals are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Evaluate your LLM-powered apps with TypeScript☆1,463Mar 27, 2026Updated last month
- ☆56Apr 2, 2026Updated 3 weeks ago
- JavaScript Tracing & Evals library for Braintrust☆14Updated this week
- The TypeScript LLM Evaluation Library☆156Nov 11, 2025Updated 5 months ago
- ☆392Updated this week
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- Prompt design using JSX.☆2,787Oct 15, 2025Updated 6 months ago
- structured outputs for llms☆12,840Apr 22, 2026Updated last week
- A vitest extension for running evals.☆150Apr 20, 2026Updated last week
- Test your prompts, agents, and RAGs. Red teaming/pentesting/vulnerability scanning for AI. Compare performance of GPT, Claude, Gemini, Ll…☆20,722Updated this week
- The LLM Evaluation Framework☆14,993Updated this week
- Evals meant to evaluate language models' ability to reason over long contexts.☆10Sep 12, 2024Updated last year
- Python SDK for running evaluations on LLM generated responses☆299Jun 6, 2025Updated 10 months ago
- The pretty much "official" DSPy framework for Typescript☆2,608Updated this week
- 🧊 Open source LLM observability platform. One line of code to monitor, evaluate, and experiment. YC W23 🍓☆5,576Apr 23, 2026Updated last week
- Deploy on Railway without the complexity - Free Credits Offer • AdConnect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
- Laminar - open-source observability platform purpose-built for AI agents. YC S24.☆2,803Updated this week
- OTEL ingestion running on Cloudflare Workers☆49Apr 8, 2025Updated last year
- A lightweight React Hook intended mainly for AI chat applications, for smoothly sticking to bottom of messages☆723Apr 3, 2026Updated 3 weeks ago
- DSPy: The framework for programming—not prompting—language models☆34,016Apr 24, 2026Updated last week
- A Workers AI provider for the vercel AI SDK☆115Mar 18, 2025Updated last year
- 🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with Open…☆26,353Updated this week
- The AI framework that adds the engineering to prompt engineering (Python/TS/Ruby/Java/C#/Rust/Go compatible)☆8,061Updated this week
- AI Observability & Evaluation☆9,459Updated this week
- From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.☆23,351Updated this week
- Managed Database hosting by DigitalOcean • AdPostgreSQL, MySQL, MongoDB, Kafka, Valkey, and OpenSearch available. Automatically scale up storage and focus on building your apps.
- Structured Outputs☆13,741Apr 16, 2026Updated 2 weeks ago
- The SDK For Browser Agents☆22,371Updated this week
- Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.☆18,270Apr 14, 2026Updated 2 weeks ago
- Supercharge Your LLM Application Evaluations 🚀☆13,709Feb 24, 2026Updated 2 months ago
- The platform for LLM evaluations and AI agent testing☆3,231Updated this week
- ☆80Jun 5, 2024Updated last year
- Wow!☆12Oct 25, 2024Updated last year
- Developer toolkit that makes it simple to build with the Workers AI platform.☆186Apr 23, 2026Updated last week
- The leading workflow orchestration platform. Run stateful step functions and AI workflows on serverless, servers, or the edge.☆5,282Updated this week
- 1-Click AI Models by DigitalOcean Gradient • AdDeploy popular AI models on DigitalOcean Gradient GPU virtual machines with just a single click. Zero configuration with optimized deployments.
- Supercharge your local development☆369Oct 8, 2025Updated 6 months ago
- an ambient intelligence library☆6,144Apr 22, 2026Updated last week
- AI Hero's open-source examples and course material. Learn AI Engineering with a single repo.☆1,408Jul 22, 2025Updated 9 months ago
- Readymade evaluators for agent trajectories☆570Apr 21, 2026Updated last week
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆51Sep 29, 2024Updated last year
- Python SDK, Proxy Server (AI Gateway) to call 100+ LLM APIs in OpenAI (or native) format, with cost tracking, guardrails, loadbalancing a…☆45,153Updated this week
- The AI Toolkit for TypeScript. From the creators of Next.js, the AI SDK is a free open-source library for building AI-powered application…☆23,777Apr 24, 2026Updated last week