UKGovernmentBEIS / inspect_ai
Inspect: A framework for large language model evaluations
☆838Updated this week
Alternatives and similar repositories for inspect_ai:
Users that are interested in inspect_ai are comparing it to the libraries listed below
- A library for making RepE control vectors☆562Updated 2 months ago
- Collection of evals for Inspect AI☆101Updated this week
- Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard a…☆1,094Updated 2 months ago
- METR Task Standard☆146Updated last month
- Lighteval is your all-in-one toolkit for evaluating LLMs across multiple backends☆1,336Updated this week
- Evaluate your LLM's response with Prometheus and GPT4 💯