braintrustdata / autoevalsLinks
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
☆553Updated 2 weeks ago
Alternatives and similar repositories for autoevals
Users that are interested in autoevals are comparing it to the libraries listed below
Sorting:
- structured extraction for llms☆730Updated 5 months ago
- ☆152Updated 3 weeks ago
- Create state-machine-powered LLM agents using XState☆298Updated last month
- Low latency JSON generation using LLMs ⚡️☆400Updated last year
- ☆77Updated this week
- Prompt engineering, automated.☆331Updated 2 months ago
- Get structured, fully typed, and validated JSON outputs from OpenAI and Anthropic models.☆625Updated last year
- A fuzzy key value store based on semantic similarity rather lexical equality.☆277Updated 7 months ago
- AgentKit: Build multi-agent networks in TypeScript with deterministic routing and rich tooling via MCP.☆523Updated last week
- Add React components to your AI assistant, copilot, or agent.☆461Updated this week
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆333Updated this week
- The toolkit for AI devtools context engineering. Build with codebase mapping, symbol extraction, and many kinds of code search.☆545Updated this week
- Python SDK for running evaluations on LLM generated responses☆289Updated last month
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.☆170Updated last year
- SDK for UI over MCP. Create next-gen UI experiences!☆399Updated last week
- Chat with your PostHog data☆161Updated last year
- ☆150Updated this week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆245Updated last week
- ☆35Updated 2 weeks ago
- Automatically reformat any JSON into any schema with AI☆333Updated 3 months ago
- Simple AI coder that can do most of my work for me, including working on himself.☆246Updated 3 months ago
- The pretty much "official" DSPy framework for Typescript☆1,634Updated this week
- A Markdown-like syntax for writing prompts. Includes an in-editor playground.☆121Updated last year
- A fully customizable and self-hosted sandboxing solution for AI agent code execution and computer use. It features out-of-the-box support…☆510Updated last month
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆248Updated this week
- ☆359Updated this week
- The fastest, lightest, and easiest-to-integrate AI gateway on the market. Fully open-sourced.☆333Updated this week
- smol-podcaster is your podcast production agent 🎙️☆348Updated 9 months ago
- ☆169Updated last year
- TypeScript client for OpenAI's realtime voice API.☆349Updated 8 months ago