agentrebench / AgentRE-BenchView on GitHub
AgentRE-Bench is an agentic benchmark that evaluates state-of-the-art models on long-horizon reverse engineering tasks, measuring their ability to analyze binaries, use tooling effectively, and reason over multi-step execution artifacts
51Mar 3, 2026Updated last month

Alternatives and similar repositories for AgentRE-Bench

Users that are interested in AgentRE-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?