cambridgeltl / visual-spatial-reasoning

[TACL'23] VSR: A probing benchmark for spatial undersranding of vision-language models.
104Updated last year

Related projects

Alternatives and complementary repositories for visual-spatial-reasoning