agentrebench / AgentRE-BenchView on GitHub
AgentRE-Bench is an agentic benchmark that evaluates state-of-the-art models on long-horizon reverse engineering tasks, measuring their ability to analyze binaries, use tooling effectively, and reason over multi-step execution artifacts
58May 14, 2026Updated this week

Alternatives and similar repositories for AgentRE-Bench

Users that are interested in AgentRE-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?