SCAI-JHU / AutoToMLinks
AutoToM: Scaling Model-based Mental Inference via Automated Agent Modeling
☆27Updated 2 months ago
Alternatives and similar repositories for AutoToM
Users that are interested in AutoToM are comparing it to the libraries listed below
Sorting:
- ☆97Updated 6 months ago
- Benchmarking LLMs' Psychological Portrayal☆123Updated 9 months ago
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆144Updated 7 months ago
- Benchmarking LLMs' Emotional Alignment with Humans☆111Updated 7 months ago
- [ICLR'25] ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery☆103Updated last month
- PPTC Benchmark: Evaluating Large Language Models for PowerPoint Task Completion☆57Updated last year
- official implementation of paper "Process Reward Model with Q-value Rankings"☆63Updated 7 months ago
- ☆39Updated last year
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆61Updated last year
- [ICML 2025] Satori: Reinforcement Learning with Chain-of-Action-Thought Enhances LLM Reasoning via Autoregressive Search☆108Updated 4 months ago
- [ICML 2025] Flow of Reasoning: Training LLMs for Divergent Reasoning with Minimal Examples☆106Updated 2 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆77Updated last year
- Interpretable Contrastive Monte Carlo Tree Search Reasoning☆48Updated 10 months ago
- ☆53Updated 7 months ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆96Updated last year
- [NeurIPS 2024 Oral] Aligner: Efficient Alignment by Learning to Correct☆186Updated 8 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆49Updated last year
- This the implementation of LeCo☆31Updated 8 months ago
- A trainable user simulator☆34Updated 3 months ago
- [ACL'24] Chain of Thought (CoT) is significant in improving the reasoning abilities of large language models (LLMs). However, the correla…☆45Updated 4 months ago
- [ACL 2023] Solving Math Word Problems via Cooperative Reasoning induced Language Models (LLMs + MCTS + Self-Improvement)☆50Updated last year
- [ICML 2025] Teaching Language Models to Critique via Reinforcement Learning☆111Updated 4 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆55Updated last year
- B-STAR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners☆85Updated 4 months ago
- Benchmarking LLMs' Gaming Ability in Multi-Agent Environments☆88Updated 5 months ago
- [ACL24] EmoBench: Evaluating the Emotional Intelligence of Large Language Models☆92Updated 4 months ago
- PreAct: Prediction Enhances Agent's Planning Ability (Coling2025)☆29Updated 9 months ago
- ☆25Updated last year
- Code for ICLR 2024 paper "CRAFT: Customizing LLMs by Creating and Retrieving from Specialized Toolsets"☆59Updated last year
- Unofficial Implementation of Chain-of-Thought Reasoning Without Prompting☆33Updated last year