Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".
☆23Aug 16, 2023Updated 2 years ago
Alternatives and similar repositories for simulatedtom
Users that are interested in simulatedtom are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- The official repository of the OpenToM dataset☆29Feb 2, 2025Updated last year
- ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)☆19Apr 16, 2025Updated 11 months ago
- Answering Ambiguous Questions via Iterative Prompting☆14May 25, 2024Updated last year
- ☆16Oct 11, 2025Updated 5 months ago
- Code accompanying our EMNLP 2019 paper: "Revisiting the Evaluation of Theory of Mind through Question Answering"☆26Aug 9, 2020Updated 5 years ago
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting with the flexibility to host WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Cloudways by DigitalOcean.
- The Implementation of "Machine Theory of Mind", ICML 2018☆27Mar 14, 2022Updated 4 years ago
- ☆20Jun 4, 2025Updated 9 months ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆35Sep 26, 2024Updated last year
- ☆15Oct 23, 2023Updated 2 years ago
- (CVPR 2024) FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning☆20Jun 21, 2024Updated last year
- Social-AI papers across computing communities, courses, and dissertations.☆21Jun 10, 2025Updated 9 months ago
- LMTuner: Make the LLM Better for Everyone☆38Sep 21, 2023Updated 2 years ago
- Tree-of-Debate converts scientific papers into LLM personas that debate their respective novelties. To emphasize structured, critical rea…☆18Jul 22, 2025Updated 8 months ago
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆56Feb 14, 2023Updated 3 years ago
- DigitalOcean Gradient AI Platform • AdBuild production-ready AI agents using customizable tools or access multiple LLMs through a single endpoint. Create custom knowledge bases or connect external data.
- [ACL 2024] Benchmarking Knowledge Boundary for Large Language Models: A Different Perspective on Model Evaluation☆10May 26, 2024Updated last year
- The implement of LLMTreeRec☆14Dec 9, 2024Updated last year
- ☆31Aug 21, 2023Updated 2 years ago
- An implementation of Etcetera Abduction in Python☆11Nov 12, 2025Updated 4 months ago
- ☆59Dec 6, 2024Updated last year
- Simple phoenix setup for padded window management☆13Apr 25, 2018Updated 7 years ago
- Mental state inference from observable behavior☆15Dec 3, 2021Updated 4 years ago
- [ICRA 2025] RACER: Rich Language-Guided Failure Recovery Policies for Imitation Learning☆42Oct 10, 2024Updated last year
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆59May 31, 2024Updated last year
- End-to-end encrypted cloud storage - Proton Drive • AdSpecial offer: 40% Off Yearly / 80% Off First Month. Protect your most important files, photos, and documents from prying eyes.
- Discussion Notes, SU2020 VE280: Programming and Data Structure, SJTU.☆13Dec 27, 2020Updated 5 years ago
- Multi-modality Hierarchical Recall based on GBDTs for Bipolar Disorder Classification☆10Jul 12, 2023Updated 2 years ago
- Code for "Multilingual language models predict human reading behavior"☆12Oct 9, 2022Updated 3 years ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆68Jun 24, 2024Updated last year
- Promoting critical thinking through machine-generated prompts.☆19Sep 21, 2021Updated 4 years ago
- ☆13Mar 15, 2022Updated 4 years ago
- Source code for "An Empirical Study of Code Smells in Transformer-based Code Generation Techniques".☆11Oct 4, 2022Updated 3 years ago
- Find context neurons in Pythia models.☆13Jun 13, 2023Updated 2 years ago
- ☆18Nov 13, 2024Updated last year
- Wordpress hosting with auto-scaling on Cloudways • AdFully Managed hosting built for WordPress-powered businesses that need reliable, auto-scalable hosting. Cloudways SafeUpdates now available.
- P2P Social network☆16May 25, 2015Updated 10 years ago
- [CVPR 2026] WorldMM: Dynamic Multimodal Memory Agent for Long Video Reasoning☆63Updated this week
- Language Models for Code Completion: a Practical Evaluation☆13Jan 19, 2024Updated 2 years ago
- TaCo: Enhancing Cross-Lingual Transfer for Low-Resource Languages in LLMs through Translation-Assisted Chain-of-Thought Processes☆14Jul 1, 2025Updated 8 months ago
- Code for "ECoFLaP: Efficient Coarse-to-Fine Layer-Wise Pruning for Vision-Language Models" (ICLR 2024)☆20Feb 16, 2024Updated 2 years ago
- ☆19Mar 5, 2024Updated 2 years ago
- EgoToM is an egocentric theory-of-mind benchmark built on Ego4D videos, containing multi-choice questions that evaluate multimodal large …☆13Apr 1, 2025Updated 11 months ago