nttmdlab-nlp / ToMATOLinks
ToMATO: Verbalizing the Mental States of Role-Playing LLMs for Benchmarking Theory of Mind (AAAI2025)
☆19Updated 9 months ago
Alternatives and similar repositories for ToMATO
Users that are interested in ToMATO are comparing it to the libraries listed below
Sorting:
- ☆59Updated last year
- ☆13Updated last year
- The Unreliability of Explanations in Few-shot Prompting for Textual Reasoning (NeurIPS 2022)☆16Updated 2 years ago
- ☆56Updated last year
- Code repository for the paper "Mission: Impossible Language Models."☆56Updated 4 months ago
- [ICML 2024] Fool Your (Vision and) Language Model With Embarrassingly Simple Permutations☆15Updated 2 years ago
- ControlLM is a method to control the personality traits and behaviors of language models in real-time at inference without costly trainin…☆19Updated last year
- ☆37Updated 2 years ago
- AbstainQA, ACL 2024☆28Updated this week
- ☆20Updated last year
- This repository includes the implementation and results of the paper "ChatGPT is fun, but it is not funny! Humor is still challenging Lar…☆13Updated 2 years ago
- GSM-Plus: Data, Code, and Evaluation for Enhancing Robust Mathematical Reasoning in Math Word Problems.☆64Updated last year
- [COLM 2025] EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees☆31Updated 6 months ago
- List of papers on Self-Correction of LLMs.☆80Updated last year
- Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lo…☆16Updated last year
- Evaluate the Quality of Critique☆36Updated last year
- ☆27Updated last year
- [ICML 2024] One Prompt is Not Enough: Automated Construction of a Mixture-of-Expert Prompts - TurningPoint AI☆31Updated last year
- Investigating Cultural Alignment of Large Language Models☆13Updated last year
- ☆53Updated 10 months ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆30Updated 11 months ago
- PASTA: Post-hoc Attention Steering for LLMs☆134Updated last year
- Code associated with Tuning Language Models by Proxy (Liu et al., 2024)☆127Updated last year
- ☆57Updated 8 months ago
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆39Updated last year
- code for Preprint paper at Arxiv: MoT: Pre-thinking and Recalling Enable ChatGPT to Self-Improve with Memory-of-Thoughts☆24Updated 2 years ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆81Updated last year
- [AAAI 2025 oral] Evaluating Mathematical Reasoning Beyond Accuracy☆77Updated 4 months ago
- Source code for Truth-Aware Context Selection: Mitigating the Hallucinations of Large Language Models Being Misled by Untruthful Contexts☆17Updated last year
- [ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"☆109Updated 2 years ago