braintrustdata / autoevals
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
☆429Updated this week
Alternatives and similar repositories for autoevals:
Users that are interested in autoevals are comparing it to the libraries listed below
- ☆145Updated last month
- Low latency JSON generation using LLMs ⚡️☆396Updated last year
- Prompt engineering, automated.☆288Updated 3 months ago
- Python SDK for running evaluations on LLM generated responses☆272Updated this week
- ☆195Updated 10 months ago
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.☆168Updated 9 months ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆299Updated this week
- structured extraction for llms☆691Updated last month
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆230Updated this week
- AgentKit: Build multi-agent networks in TypeScript with deterministic routing and rich tooling via MCP.☆312Updated this week
- ☆108Updated last week
- Super performant RAG pipelines for AI apps. Summarization, Retrieve/Rerank and Code Interpreters in one simple API.☆363Updated 10 months ago
- Automatically reformat any JSON into any schema with AI☆327Updated last week
- A Lightweight Library for AI Observability☆237Updated last month
- Create state-machine-powered LLM agents using XState☆260Updated last week
- ☆273Updated last week
- Python and TypeScript library for integrating the Stripe API into agentic workflows☆551Updated this week
- Routing on Random Forest (RoRF)☆135Updated 5 months ago
- llm-consortium orchestrates mulitple LLMs, iteratively refines & achieves consensus.☆178Updated last week
- smol-podcaster is your podcast production agent 🎙️☆332Updated 6 months ago
- OpenTelemetry Instrumentation for AI Observability☆334Updated this week
- A simple Python sandbox for helpful LLM data agents☆235Updated 9 months ago
- ☆398Updated 7 months ago
- Data-Driven Evaluation for LLM-Powered Applications☆484Updated last month
- 🤖 Headless IDE for AI agents☆172Updated 3 weeks ago
- Simple AI coder that can do most of my work for me, including working on himself.☆235Updated last month