zli12321 / qa_metricsView on GitHub
An easy python package to run quick basic QA evaluations. This package includes standardized QA evaluation metrics and semantic evaluation metrics: Black-box and Open-Source large language model prompting and evaluation, exact match, F1 Score, PEDANT semantic match, transformer match. Our package also supports prompting OPENAI and Anthropic API.
61Jul 18, 2025Updated 7 months ago

Alternatives and similar repositories for qa_metrics

Users that are interested in qa_metrics are comparing it to the libraries listed below

Sorting:

Are these results useful?