nttmdlab-nlp/ToMATO

Readme badge preview -

If you own this repo, copy the snippet below and add it to your README.md

[![RelatedRepos](https://img.shields.io/badge/related-repos-yellow)](https://relatedrepos.com/gh/nttmdlab-nlp/ToMATO)

nttmdlab-nlp / ToMATO

ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)

☆20

Alternatives and similar repositories for ToMATO

Users that are interested in ToMATO are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.

Sorting:

shawnsihyunlee / simulatedtom
View on GitHub
Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".
☆25Aug 16, 2023Updated 2 years ago
seacowx / OpenToM
View on GitHub
The official repository of the OpenToM dataset
☆33Feb 2, 2025Updated last year
cicl-stanford / procedural-evals-tom
View on GitHub
☆40Jul 16, 2023Updated 3 years ago
zhchen18 / ToMBench
View on GitHub
ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.
☆68Jun 24, 2024Updated 2 years ago
abwilf / Social-IQ-2.0-Challenge
View on GitHub
The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23
☆38Oct 13, 2023Updated 2 years ago
Managed hosting for WordPress and PHP on Cloudways • Ad
Managed hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
kayburns / tom-qa-dataset
View on GitHub
☆24Oct 31, 2018Updated 7 years ago
noiseQA / NoiseQA
View on GitHub
☆12Feb 22, 2021Updated 5 years ago
aiishii / JEMHopQA
View on GitHub
☆30Apr 10, 2025Updated last year
allenai / faithful-nmn
View on GitHub
Evaluating and improving the faithfulness of the interpretations offered by Neural Module Networks
☆13Jun 12, 2023Updated 3 years ago
iesl / CE2ERE
View on GitHub
Constrained learning using boxes for event-event relation extraction
☆12Aug 5, 2022Updated 3 years ago
scottclowe / pytorch-experiment-template
View on GitHub
☆15Updated this week
eujhwang / vn-analysis
View on GitHub
virtual node analysis on ogb benchmark dataset
☆14Mar 9, 2023Updated 3 years ago
chuanyangjin / MMToM-QA
View on GitHub
[🏆Outstanding Paper Award at ACL 2024] MMToM-QA: Multimodal Theory of Mind Question Answering
☆159Jun 28, 2026Updated 3 weeks ago
julianje / Bishop
View on GitHub
Mental state inference from observable behavior
☆15Dec 3, 2021Updated 4 years ago
Deploy to Railway using AI coding agents - Free Credits Offer • Ad
Use Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
dayoon-ko / ExFunTube
View on GitHub
The source code of ExFunTube
☆10Aug 8, 2025Updated 11 months ago
SCAI-JHU / MuMA-ToM
View on GitHub
[AAAI 2025 𝐎𝐫𝐚𝐥] MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
☆41Jun 28, 2026Updated 3 weeks ago
double125 / Graph-Matching-Attention
View on GitHub
Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering
☆11Feb 16, 2023Updated 3 years ago
K-Kuyama / yet-another-UI-for-AW
View on GitHub
UI for ActivityWatch. Include category editor and viewer for multiple categorizations.
☆10Jan 31, 2024Updated 2 years ago
keiji / region_cropper
View on GitHub
Help creating image dataset for machine learning.
☆10Nov 4, 2020Updated 5 years ago
yunshiuan / tomnet-project
View on GitHub
This repo contains the ToMnet+ model for preference inference. Developed by Yun-Shiuan, Edwinn, Hsin-Yi, and Elaine.
☆10Feb 24, 2023Updated 3 years ago
HAE-RAE / HAERAE-VISION
View on GitHub
Evaluation code for HAERAE-Vision benchmark
☆15Apr 29, 2026Updated 2 months ago
Yebin46 / FLEUR
View on GitHub
[ACL 2024] FLEUR: An Explainable Reference-Free Evaluation Metric for Image Captioning Using a Large Multimodal Model
☆17Apr 28, 2025Updated last year
XiangLi1999 / AutoBencher
View on GitHub
☆33Jul 11, 2024Updated 2 years ago
Deploy open-source AI quickly and easily - Special Bonus Offer • Ad
Runpod Hub is built for open source. One-click deployment and autoscaling endpoints without provisioning your own infrastructure.
alecwangcq / f-divergence-dpo
View on GitHub
Direct preference optimization with f-divergences.
☆17Nov 3, 2024Updated last year
scloudyy / Defogging
View on GitHub
A python package of robust and effective defogging/dehazing method
☆15Dec 30, 2018Updated 7 years ago
marcotcr / qa_consistency
View on GitHub
Evaluate QA models for consistency
☆20Nov 21, 2022Updated 3 years ago
orcax / LOGER
View on GitHub
Faithfully Explainable Recommendation via Neural Logic Reasoning
☆16May 3, 2021Updated 5 years ago
jingchenchen / ReasoningConsistency-VQA
View on GitHub
☆13Aug 14, 2022Updated 3 years ago
a01sa01to / TitleAndURL_Picker
View on GitHub
Chrome Extension. As the name suggests.
☆10Jan 30, 2022Updated 4 years ago
MSEDdataset / MSED
View on GitHub
☆11Dec 22, 2021Updated 4 years ago
giovannicoppola / alfred-yaanki
View on GitHub
yet another anki app
☆14Sep 9, 2024Updated last year
hyounghk / CoSIm
View on GitHub
Code and dataset for NAACL 2022 paper "CoSIm: Commonsense Reasoning for Counterfactual Scene Imagination" Hyounghun Kim, Abhay Zala, Mohi…
☆16Nov 26, 2022Updated 3 years ago
Deploy on Railway without the complexity - Free Credits Offer • Ad
Connect your repo and Railway handles the rest with instant previews. Quickly provision container image services, databases, and storage volumes.
CLAW-Lab / ToM
View on GitHub
Code accompanying ICML 2021 paper "Few-shot Language Coordination by Modeling Theory of Mind"
☆18May 18, 2022Updated 4 years ago
jlparkI / mix_T
View on GitHub
Python (pip) package for fitting mixtures of Student's t-distributions using either maximum likelihood (EM) or Bayesian methodology (vari…
☆11Sep 23, 2025Updated 10 months ago
Geralt-Targaryen / MC-Evaluation
View on GitHub
☆14May 21, 2024Updated 2 years ago
lbox-kr / kbl
View on GitHub
Korean Benchmark for Korean Legal Language Understanding
☆19Nov 16, 2024Updated last year
lyveng / pandas-hbase
View on GitHub
Pandas Helper Library for reading and writing DataFrames from and to HBase.
☆10Mar 8, 2018Updated 8 years ago
VimalWill / Vstream
View on GitHub
Vstream - Video Analytics pipeline with Hardware based accelerations (dev - stage)
☆10Feb 2, 2024Updated 2 years ago
eumesy / wrd
View on GitHub
Word Rotator's Distance
☆18Sep 5, 2021Updated 4 years ago