FoundationAgents / VR-BenchLinks

We introduce Reasoning via Video, a new paradigm that uses maze-solving video generation to probe multimodal reasoning; our VR-Bench shows that fine-tuned video models consistently outperform strong VLMs on long-horizon spatial planning tasks.
46Updated this week

Alternatives and similar repositories for VR-Bench

Users that are interested in VR-Bench are comparing it to the libraries listed below

Sorting: