JingbiaoMei / ATM-BenchView on GitHub
ATM-Bench: A benchmark for long-term personalized memory QA spanning ~4 years of multimodal data (images, videos, emails). Features referential queries, evidence-grounded answering, and multi-source reasoning. Paper: "According to Me: Long-Term Personalized Referential Memory QA"
39Apr 10, 2026Updated 2 weeks ago

Alternatives and similar repositories for ATM-Bench

Users that are interested in ATM-Bench are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

Are these results useful?