facebookresearch / EgoToMView on GitHub
EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large language models' ability to infer a camera wearer's goals, in-the-moment belief states, and future actions.
13Apr 1, 2025Updated 11 months ago

Alternatives and similar repositories for EgoToM

Users that are interested in EgoToM are comparing it to the libraries listed below

Sorting:

Are these results useful?