AlmogBaku / pytest-evals
A pytest plugin for running and analyzing LLM evaluation tests.
☆116Updated last month
Alternatives and similar repositories for pytest-evals:
Users that are interested in pytest-evals are comparing it to the libraries listed below
- Work with OpenAI's streaming API at ease with Python generators☆121Updated 10 months ago
- Transform your pythonic research to an artifact that engineers can deploy easily.☆151Updated last week
- HyPSTER - HyperParameter optimization on STERoids☆47Updated 4 months ago
- ☆10Updated 7 months ago
- ☆12Updated 3 weeks ago
- ☆13Updated this week
- Python library that allows you to get structured responses in the form of Pydantic models and Python types from Anthropic, Google Vertex …☆78Updated 8 months ago
- 🪢 Langfuse Python SDK - Instrument your LLM app with decorators or low-level SDK and get detailed tracing/observability. Works with any …☆151Updated this week
- An agentic company research tool powered by LangGraph and Tavily that conducts deep diligence on companies using a multi-agent framework.…☆40Updated this week
- OpenTelemetry Instrumentation for AI Observability☆349Updated this week
- Record your service operations in production and replay them locally at any time in a sandbox☆105Updated 2 months ago
- Deploy Haystack pipelines behind a REST Api.☆61Updated this week
- Metafeature Extraction for Unstructured Data☆101Updated 2 weeks ago
- Synthetic Data SDK ✨☆351Updated this week
- This project implements the "Modular RAG" framework using Haystack & Hypster☆31Updated 4 months ago
- LLM Security Platform.☆10Updated 5 months ago
- A lightweight tool that lets you simply build prompts and get Pydantic objects as outputs☆18Updated 10 months ago
- Named Entity Recognition using Claude Citations☆64Updated 2 weeks ago
- Curated list of tools and frameworks assisting in monitoring data quality☆12Updated 2 years ago
- Additional packages (components, document stores and the likes) to extend the capabilities of Haystack version 2.0 and onwards☆139Updated this week
- A tiny LLM Agent with minimal dependencies, focused on local inference.☆52Updated 5 months ago
- A Pythonic integration for LLMs.☆88Updated last year
- A Lightweight Library for AI Observability☆238Updated last month
- Promptimize is a prompt engineering evaluation and testing toolkit.☆456Updated last month
- Hebrew oriented NER spaCy pipeline☆15Updated 7 months ago
- An organizational AI system to build a suite of AI assistants leveraging ontologies as a unifying field that connect data, AI models, wor…☆56Updated this week
- 🧪 Experimental features for Haystack☆41Updated this week
- An AI extension for IPython that makes it work like Cursor☆63Updated 2 months ago
- Inspect repository data, including countries and organizations of stargazers and forkers.☆38Updated last year
- Product analytics for AI Assistants☆149Updated last week