MuMA-ToM: Multi-modal Multi-Agent Theory of Mind
☆38Jan 23, 2025Updated last year
Alternatives and similar repositories for MuMA-ToM
Users that are interested in MuMA-ToM are comparing it to the libraries listed below
Sorting:
- ☆16Oct 11, 2025Updated 4 months ago
- ☆22Nov 8, 2023Updated 2 years ago
- AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling☆39Dec 26, 2025Updated 2 months ago
- ☆37Jul 16, 2023Updated 2 years ago
- The Social-IQ 2.0 Challenge Release for the Artificial Social Intelligence Workshop at ICCV '23☆36Oct 13, 2023Updated 2 years ago
- ☆12May 6, 2024Updated last year
- ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)☆19Apr 16, 2025Updated 10 months ago
- [NeurIPS D&B Track 2024] Source code for the paper "Constrained Human-AI Cooperation: An Inclusive Embodied Social Intelligence Challenge…☆23May 2, 2025Updated 9 months ago
- Social-AI papers across computing communities, courses, and dissertations.☆21Jun 10, 2025Updated 8 months ago
- Source code for our paper: "ARIA: Training Language Agents with Intention-Driven Reward Aggregation".☆26Aug 9, 2025Updated 6 months ago
- Code for ExploreTom☆91Jun 25, 2025Updated 8 months ago
- Official code for our EMNLP2021 Outstanding Paper MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks☆21May 18, 2023Updated 2 years ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆64Jun 24, 2024Updated last year
- DCPO: Dynamic Adaptive Clipping for RL☆45Dec 20, 2025Updated 2 months ago
- ☆60Jan 12, 2026Updated last month
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆59May 31, 2024Updated last year
- The first spoken long-text dataset derived from live streams, designed to reflect the redundancy-rich and conversational nature of real-w…☆12Jun 28, 2025Updated 8 months ago
- A Text2SQL benchmark for evaluation of Large Language Models☆41Updated this week
- [NeurIPS 2024] Official Repository of Multi-Object Hallucination in Vision-Language Models☆34Nov 13, 2024Updated last year
- ☆30Sep 28, 2023Updated 2 years ago
- ☆46Sep 27, 2025Updated 5 months ago
- ☆18Jun 10, 2025Updated 8 months ago
- [NeurIPS ENLSP Workshop'24] CSKV: Training-Efficient Channel Shrinking for KV Cache in Long-Context Scenarios☆16Oct 18, 2024Updated last year
- (ICLR 2026) Optimas: Optimizing Compound AI Systems☆76Feb 6, 2026Updated 3 weeks ago
- [ACM MM2025] The official repository for the RealSyn dataset☆40Dec 14, 2025Updated 2 months ago
- ☆63Jul 11, 2025Updated 7 months ago
- Hypothetical Minds is an autonomous LLM-based agent for diverse multi-agent settings, integrating a Theory of Mind module Theory of Mind …☆39Jul 13, 2024Updated last year
- ☆11Jun 22, 2025Updated 8 months ago
- Symphony — A decentralized multi-agent framework that enables intelligent agents to collaborate seamlessly across heterogeneous edge devi…☆30Oct 30, 2025Updated 4 months ago
- [ICLR 2026] ParallelBench: Understanding the Tradeoffs of Parallel Decoding in Diffusion LLMs☆30Updated this week
- The official implement of paper 《DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone Agents》☆29Oct 23, 2025Updated 4 months ago
- A Framework for Evaluating AI Agent Safety in Realistic Environments☆30Oct 2, 2025Updated 5 months ago
- [ICML 2024] Official Repository for the paper "Transformers Get Stable: An End-to-End Signal Propagation Theory for Language Models"☆10Jul 19, 2024Updated last year
- Bilateral Cross-Modality Graph Matching Attention for Feature Fusion in Visual Question Answering☆11Feb 16, 2023Updated 3 years ago
- mReasoner is a unified computational implementation of the model theory of thinking and reasoning☆13Aug 17, 2023Updated 2 years ago
- A toolkit for evaluation of natural language generation (NLG), including BLEU, ROUGE, METEOR, and CIDEr.☆33Sep 4, 2020Updated 5 years ago
- ☆38Feb 6, 2025Updated last year
- ☆45Dec 1, 2025Updated 3 months ago
- ☆36Mar 20, 2017Updated 8 years ago