AlmogBaku / pytest-evalsLinks
A pytest plugin for running and analyzing LLM evaluation tests.
☆131Updated 5 months ago
Alternatives and similar repositories for pytest-evals
Users that are interested in pytest-evals are comparing it to the libraries listed below
Sorting:
- Work with OpenAI's streaming API at ease with Python generators☆121Updated last year
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆78Updated last year
- Open-source versioning, tracing, and annotation tooling.☆165Updated this week
- HyPSTER - HyperParameter optimization on STERoids☆48Updated 7 months ago
- 🪢 Langfuse Python SDK - Instrument your LLM app with decorators or low-level SDK and get detailed tracing/observability. Works with any …☆216Updated this week
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack☆160Updated this week
- LLM prompt language based on Jinja. Banks provides tools and functions to build prompts text and chat messages from generic blueprints. I…☆108Updated this week
- A small library of LLM judges☆232Updated 3 weeks ago
- ☆74Updated 8 months ago
- 🦄 ai that works - every tuesday 10 AM PST☆166Updated this week
- A Lightweight Library for AI Observability☆246Updated 4 months ago
- Python SDK for running evaluations on LLM generated responses☆289Updated last month
- Using various instructor clients evaluating the quality and capabilities of extractions and reasoning.☆52Updated 9 months ago
- ☆71Updated 8 months ago
- ☆151Updated this week
- Pydantic extension for annotating autocorrecting fields.☆222Updated last year
- ☆10Updated 10 months ago
- Build super simple end-to-end data & ETL pipelines for your vector databases and Generative AI applications☆98Updated 9 months ago
- OpenTelemetry Instrumentation for AI Observability☆503Updated this week
- Transform your pythonic research to an artifact that engineers can deploy easily.☆154Updated last month
- A WhatsApp bot that can participate in group conversations, powered by AI. The bot monitors group messages and responds when mentioned.☆89Updated this week
- Named Entity Recognition using Claude Citations☆77Updated last month
- syftr is an agent optimizer that helps you find the best agentic workflows for your budget.☆284Updated last week
- ☆168Updated last year
- An AI extension for IPython that makes it work like Cursor☆67Updated 6 months ago
- Claudette is Claude's friend☆247Updated this week
- Convert an AI Agent into a A2A server! ✨☆72Updated this week
- ☆76Updated last week
- Kura is a simple reproduction of the CLIO paper which uses language models to label user behaviour before clustering them based on embedd…☆251Updated 2 weeks ago
- A python implementation of priompt - a neat way of managing context from diverse sources for LLM applications.☆112Updated last week