AlmogBaku / pytest-evalsLinks
A pytest plugin for running and analyzing LLM evaluation tests.
☆144Updated 9 months ago
Alternatives and similar repositories for pytest-evals
Users that are interested in pytest-evals are comparing it to the libraries listed below
Sorting:
- Pydantic extension for annotating autocorrecting fields.☆222Updated last year
- Convert an AI Agent into a A2A server! ✨☆138Updated last month
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆79Updated 2 months ago
- Calculate prices for calling LLM inference APIs.☆156Updated this week
- Python browser sandbox.☆182Updated 7 months ago
- ☆120Updated this week
- ☆77Updated 8 months ago
- Promptimize is a prompt engineering evaluation and testing toolkit.☆485Updated 2 weeks ago
- The Logfire MCP Server is here!☆127Updated 2 months ago
- OpenTelemetry Instrumentation for AI Observability☆740Updated this week
- An AI extension for IPython that makes it work like Cursor☆69Updated 10 months ago
- HyPSTER - Configuration Framework for Optimizing AI & AI Systems☆55Updated 2 months ago
- 🪢 Langfuse Python SDK - Instrument your LLM app with decorators or low-level SDK and get detailed tracing/observability. Works with any …☆298Updated this week
- ☆84Updated last year
- ☆252Updated this week
- A Lightweight Library for AI Observability☆251Updated 9 months ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆128Updated last week
- RAG orchestration framework ⛵️☆201Updated 4 months ago
- Work with OpenAI's streaming API at ease with Python generators☆122Updated last year
- Open-source versioning, tracing, and annotation tooling.☆205Updated 3 weeks ago
- SUQL: Conversational Search over Structured and Unstructured Data with LLMs☆291Updated last month
- Python port of part of the TypeAgent repo☆351Updated this week
- Jambo - JSON Schema to Pydantic Converter☆68Updated this week
- Easily deploy Haystack pipelines as REST APIs and MCP Tools.☆127Updated this week
- A small library of LLM judges☆302Updated 3 months ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆152Updated this week
- Build reliable AI and agentic applications with DataFrames☆407Updated this week
- ☆197Updated last week
- 🌱 Substrate is a modern Copier template for scaffolding Python packages and apps☆335Updated 2 months ago
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆117Updated 4 months ago