nttmdlab-nlp / ToMATOLinks
ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)
☆13Updated last month
Alternatives and similar repositories for ToMATO
Users that are interested in ToMATO are comparing it to the libraries listed below
Sorting:
- List of papers on Self-Correction of LLMs.☆73Updated 5 months ago
- The source code for running LLMs on the AAAR-1.0 benchmark.☆16Updated 2 months ago
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆15Updated 6 months ago
- personalized-llms with allen institute☆15Updated last year
- Code/data for MARG (multi-agent review generation)☆43Updated 6 months ago
- MultiTool-CoT: GPT-3 Can Use Multiple External Tools with Chain of Thought Prompting☆21Updated last year
- Metacognitive Prompting Improves Understanding in Large Language Models (NAACL 2024)☆34Updated last year
- ☆22Updated 3 weeks ago
- ☆31Updated last year
- Leveraging Outputs of Large Language Model as Feedback for Dynamic Reranking in Retrieval-Augmented Generation☆24Updated 3 weeks ago
- [NeurIPS 2024] Train LLMs with diverse system messages reflecting individualized preferences to generalize to unseen system messages☆47Updated 6 months ago
- [ACL 2024] <Large Language Models for Automated Open-domain Scientific Hypotheses Discovery>. It has also received the best poster award …☆41Updated 7 months ago
- [EMNLP 2024 Findings] ProSA: Assessing and Understanding the Prompt Sensitivity of LLMs☆27Updated 2 weeks ago
- [ICML 2024] Language Models Represent Beliefs of Self and Others☆32Updated 8 months ago
- Discriminator-Guided Chain-of-Thought Reasoning☆47Updated 7 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆65Updated last year
- ☆16Updated 4 months ago
- This repository contains the code and data for the paper "VisOnlyQA: Large Vision Language Models Still Struggle with Visual Perception o…☆23Updated 2 months ago
- [ACL2023, Findings] Source codes for the paper "Werewolf Among Us: Multimodal Resources for Modeling Persuasion Behaviors in Social Deduc…☆13Updated 3 months ago
- ☆23Updated last year
- This is the repository for paper EscapeBench: Pushing Language Models to Think Outside the Box☆15Updated 5 months ago
- Tasks for describing differences between text distributions.☆16Updated 9 months ago
- Code and data for the ACL 2024 Findings paper "Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning"☆26Updated last year
- Mosaic IT: Enhancing Instruction Tuning with Data Mosaics☆18Updated 3 months ago
- Repository for Skill Set Optimization☆13Updated 10 months ago
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆18Updated 7 months ago
- ☆21Updated 7 months ago
- ☆52Updated 6 months ago
- Public code repo for COLING 2025 paper "Aligning LLMs with Individual Preferences via Interaction"☆27Updated 2 months ago
- Investigating Cultural Alignment of Large Language Models☆12Updated 9 months ago