braintrustdata / autoevalsLinks
AutoEvals is a tool for quickly and easily evaluating AI model outputs using best practices.
☆618Updated last week
Alternatives and similar repositories for autoevals
Users that are interested in autoevals are comparing it to the libraries listed below
Sorting:
- structured extraction for llms☆753Updated 7 months ago
- Create state-machine-powered LLM agents using XState☆310Updated 3 months ago
- ☆87Updated this week
- AgentKit: Build multi-agent networks in TypeScript with deterministic routing and rich tooling via MCP.☆592Updated this week
- Get structured, fully typed, and validated JSON outputs from OpenAI and Anthropic models.☆625Updated last year
- ☆154Updated 3 months ago
- Prompt engineering, automated.☆340Updated 4 months ago
- Low latency JSON generation using LLMs ⚡️☆399Updated last year
- The pretty much "official" DSPy framework for Typescript☆1,955Updated this week
- Fully typed & consistent chat APIs for OpenAI, Anthropic, Groq, and Azure's chat models for browser, edge, and node environments.☆169Updated last year
- Sister project to OpenLLMetry, but in Typescript. Open-source observability for your LLM application, based on OpenTelemetry☆356Updated this week
- Chat with your PostHog data☆161Updated last year
- ☆178Updated this week
- The toolkit for AI devtools context engineering. Build with codebase mapping, symbol extraction, and many kinds of code search.☆612Updated this week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆322Updated last week
- Compose data structures, serialize them to prompts.☆67Updated 2 months ago
- Python SDK for running evaluations on LLM generated responses☆292Updated 3 months ago
- Optimize prompts, code, and more with AI-powered Reflective Text Evolution☆580Updated last week
- Add generative UI components to your AI assistant, copilot, or agent.☆667Updated this week
- Automatically reformat any JSON into any schema with AI☆335Updated 6 months ago
- ☆38Updated last month
- TypeScript client for OpenAI's realtime voice API.☆348Updated last month
- Provider-agnostic, open-source evaluation infrastructure for language models☆531Updated this week
- Evaluate your LLM-powered apps with TypeScript☆859Updated 2 weeks ago
- Write programs you can talk to.☆376Updated last year
- Reasoning Augmented Generation☆879Updated 2 months ago
- ☆369Updated this week
- smol-podcaster is your podcast production agent 🎙️☆351Updated last month
- Easily spin up an MCP Server on Next.js, Nuxt, Svelte, and more☆369Updated last week
- ☆167Updated this week