shawnsihyunlee / simulatedtom
Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".
☆13Updated last year
Related projects ⓘ
Alternatives and complementary repositories for simulatedtom
- ☆26Updated last year
- ☆11Updated 6 months ago
- Official Implementation for the paper "Integrative Decoding: Improving Factuality via Implicit Self-consistency"☆17Updated last month
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆59Updated 7 months ago
- ToMBench: Benchmarking Theory of Mind in Large Language Models, ACL 2024.☆31Updated 4 months ago
- Benchmarking LLMs' Psychological Portrayal☆66Updated 3 months ago
- Official reposity for paper "High-Dimension Human Value Representation in Large Language Models"☆20Updated 4 months ago
- EMNLP 2022: "MABEL: Attenuating Gender Bias using Textual Entailment Data" https://arxiv.org/abs/2210.14975☆37Updated 11 months ago
- Code and data for paper "Context-faithful Prompting for Large Language Models".☆39Updated last year
- Contrastive Chain-of-Thought Prompting☆53Updated 11 months ago
- Data and code for ACL 2022 paper "MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual Data"☆41Updated 3 weeks ago
- Personality Alignment of Language Models☆18Updated 2 months ago
- Codes for Mitigating Unhelpfulness in Emotional Support Conversations with Multifaceted AI Feedback (ACL 2024 Findings)☆12Updated 4 months ago
- Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)☆54Updated 10 months ago
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆48Updated 8 months ago
- ☆46Updated 10 months ago
- ☆25Updated 7 months ago
- ☆82Updated last year
- ☆15Updated 9 months ago
- This is the official repository for the paper "EmoBench: Evaluating the Emotional Intelligence of Large Language Models"☆47Updated 8 months ago
- Paper collections of methods that using language to interact with environment, including interact with real world, simulated world or WWW…☆122Updated last year
- Supporting code for ReCEval paper☆26Updated last month
- Repository for the Bias Benchmark for QA dataset.☆85Updated 10 months ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆92Updated last year
- ☆16Updated last year
- Machine Theory of Mind Reading List. Built upon EMNLP Findings 2023 Paper: Towards A Holistic Landscape of Situated Theory of Mind in Lar…☆101Updated 9 months ago
- Code for the paper "Self-Detoxifying Language Models via Toxification Reversal" (EMNLP 2023)☆15Updated last year
- GPT as Human☆18Updated 9 months ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆18Updated last year
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆55Updated last year