lishangyu-hkust / OSVBenchView on GitHub
(AAAI 2026) OSVBench, a new benchmark for evaluating Large Language Models (LLMs) in generating complete specification code pertaining to operating system kernel verification tasks.
13May 13, 2025Updated 10 months ago

Alternatives and similar repositories for OSVBench

Users that are interested in OSVBench are comparing it to the libraries listed below

Sorting:

Are these results useful?