shawnsihyunlee / simulatedtom
Public repository for "Think Twice: Perspective-Taking Improves Large Language Models’ Theory-of-Mind Capabilities".
☆17Updated last year
Alternatives and similar repositories for simulatedtom:
Users that are interested in simulatedtom are comparing it to the libraries listed below
- ☆51Updated last year
- Public code repo for paper "Aligning LLMs with Individual Preferences via Interaction"☆24Updated 5 months ago
- ☆41Updated last year
- [ACL'24] Code and data of paper "When is Tree Search Useful for LLM Planning? It Depends on the Discriminator"☆54Updated last year
- The official repository of "Improving Large Language Models via Fine-grained Reinforcement Learning with Minimum Editing Constraint"☆37Updated last year
- This is the official repo for Towards Uncertainty-Aware Language Agent.☆24Updated 7 months ago
- [NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆52Updated 4 months ago
- AbstainQA, ACL 2024☆25Updated 5 months ago
- ☆29Updated last year
- The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism☆28Updated 8 months ago
- ☆20Updated 10 months ago
- [ICLR'24 Spotlight] "Adaptive Chameleon or Stubborn Sloth: Revealing the Behavior of Large Language Models in Knowledge Conflicts"☆67Updated 11 months ago
- Generating diverse counterfactual data for Natural Language Understanding tasks using Large Language Models (LLMs). The generator support…☆36Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆114Updated last year
- Semi-Parametric Editing with a Retrieval-Augmented Counterfactual Model☆66Updated 2 years ago
- Personality Alignment of Language Models☆27Updated 3 weeks ago
- Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…☆58Updated 2 years ago
- Visual and Embodied Concepts evaluation benchmark☆21Updated last year
- ☆22Updated 6 months ago
- In-Context Sharpness as Alerts: An Inner Representation Perspective for Hallucination Mitigation (ICML 2024)☆57Updated last year
- Official PyTorch Implementation of EMoE: Unlocking Emergent Modularity in Large Language Models [main conference @ NAACL2024]☆29Updated 10 months ago
- Code for RL4F: Generating Natural Language Feedback with Reinforcement Learning for Repairing Model Outputs. ACL 2023.☆62Updated 4 months ago
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆61Updated 10 months ago
- Tasks for describing differences between text distributions.☆16Updated 7 months ago
- [ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.☆98Updated 2 years ago
- Evaluate the Quality of Critique☆34Updated 10 months ago
- ☆36Updated last year
- ☆12Updated 10 months ago
- Target-oriented Proactive Dialogue Systems with Personalization: Problem Formulation and Dataset Curation (EMNLP 2023)☆30Updated 11 months ago
- Methods and evaluation for aligning language models temporally☆29Updated last year