VibeBench / VibeSearchBenchView on GitHub
🔍 The hardest search benchmark in the wild — vague, multi-turn, proactive. 200 long-horizon tasks with persona-driven progressive disclosure, scored by verifiable schema-free knowledge-graph evaluation. No vibes, just triplet F1.
479May 28, 2026Updated last week

Alternatives and similar repositories for VibeSearchBench

Users that are interested in VibeSearchBench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?