Orolol / familyBenchView on GitHub
FamilyBench evaluation tool for testing the relational reasoning capabilities of Large Language Models (LLMs).
43Oct 6, 2025Updated 5 months ago

Alternatives and similar repositories for familyBench

Users that are interested in familyBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?