FoundationAgents / VR-BenchView on GitHub
We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.
53Feb 4, 2026Updated 3 weeks ago

Alternatives and similar repositories for VR-Bench

Users that are interested in VR-Bench are comparing it to the libraries listed below

Sorting:

Are these results useful?