AlmogBaku / pytest-evalsLinks
A pytest plugin for running and analyzing LLM evaluation tests.
☆142Updated 8 months ago
Alternatives and similar repositories for pytest-evals
Users that are interested in pytest-evals are comparing it to the libraries listed below
Sorting:
- Pydantic extension for annotating autocorrecting fields.☆221Updated last year
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆78Updated last month
- ☆76Updated 6 months ago
- Python browser sandbox.☆179Updated 6 months ago
- Python SDK for Inngest: Durable functions and workflows in Python, hosted anywhere☆141Updated this week
- Convert an AI Agent into a A2A server! ✨☆123Updated last week
- ☆82Updated 11 months ago
- Calculate prices for calling LLM inference APIs.☆113Updated this week
- Promptimize is a prompt engineering evaluation and testing toolkit.☆480Updated last week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆116Updated 2 months ago
- Work with OpenAI's streaming API at ease with Python generators☆122Updated last year
- A Lightweight Library for AI Observability☆251Updated 7 months ago
- Run evals using LLM☆26Updated last year
- A bit of extra usability for sqlite☆214Updated 3 months ago
- Synchronicity lets you interoperate with asynchronous Python APIs.☆124Updated 2 months ago
- OpenTelemetry Instrumentation for AI Observability☆651Updated this week
- HyPSTER - Configuration Framework for Optimizing AI & AI Systems☆53Updated last month
- A pattern to let you try several vector databases and change a little code as possible☆38Updated 2 years ago
- An AI extension for IPython that makes it work like Cursor☆68Updated 9 months ago
- RAG orchestration framework ⛵️☆201Updated 3 months ago
- The Logfire MCP Server is here!☆114Updated 3 weeks ago
- ☆227Updated last month
- Plug-and-play, zero-shot document processing pipelines.☆107Updated this week
- Build reliable, secure, and production-ready AI apps easily.☆85Updated last week
- Claudette is Claude's friend☆279Updated last month
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated last year
- A small library of LLM judges☆294Updated 2 months ago
- Constrain LLM output☆113Updated last year
- An end-to-end LLM reference implementation providing a Q&A interface for Airflow and Astronomer☆268Updated 3 months ago
- The faststream-gen library uses advanced AI to generate FastStream code from user descriptions, speeding up FastStream app development.☆48Updated last year