FoundationAgents / VR-BenchView on GitHub
We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.
56Feb 4, 2026Updated last month

Alternatives and similar repositories for VR-Bench

Users that are interested in VR-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?