braintrustdata / autoevalsLinks
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
☆712Updated last week
Alternatives and similar repositories for autoevals
Users that are interested in autoevals are comparing it to the libraries listed below
Sorting:
- structured extraction for llms☆758Updated 9 months ago
- ☆101Updated last week
- Prompt engineering, automated.☆348Updated 6 months ago
- ☆155Updated 5 months ago
- Create state-machine-powered LLM agents using XState☆320Updated 5 months ago
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆362Updated last week
- Low latency JSON generation using LLMs ⚡️☆397Updated last year
- AgentKit: Build multi-agent networks in TypeScript with deterministic routing and rich tooling via MCP.☆688Updated last week
- The TypeScript LLM Evaluation Library☆148Updated this week
- Get structured, fully typed, and validated JSON outputs from OpenAI and Anthropic models.☆628Updated last year
- A fuzzy key value store based on semantic similarity rather lexical equality.☆286Updated 11 months ago
- Python SDK for running evaluations on LLM generated responses☆293Updated 5 months ago
- The pretty much "official" DSPy framework for Typescript☆2,247Updated last week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆362Updated 2 months ago
- The toolkit for AI devtools context engineering. Build with codebase mapping, symbol extraction, and many kinds of code search.☆640Updated 2 weeks ago
- Evaluate your LLM-powered apps with TypeScript☆1,139Updated this week
- Optimize prompts, code, and more with AI-powered Reflective Text Evolution☆1,526Updated this week
- Add natural language control to your React app, with MCP and generative UX☆810Updated this week
- ☆379Updated this week
- Reasoning Augmented Generation☆889Updated 4 months ago
- Chat with your PostHog data☆161Updated last year
- ☆192Updated last month
- Compose data structures, serialize them to prompts.☆68Updated 4 months ago
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.☆169Updated last year
- Automatically reformat any JSON into any schema with AI☆338Updated 8 months ago
- Easily spin up an MCP Server on Next.js, Nuxt, Svelte, and more☆472Updated last month
- ☆43Updated last week
- 🦛 CHONK your texts with Chonkie ✨ Type-friendly, light-weight, fast and super-simple chunking library☆291Updated 3 weeks ago
- Write programs you can talk to.☆378Updated last year
- Comprehensive Vector Data Tooling. The universal interface for all vector database, datasets and RAG platforms. Easily export, import, ba…☆264Updated last week