mitmedialab / nmi-ai-2023
A repository for the paper "Beliefs about AI influence human-AI interaction and can be manipulated to increase perceived trustworthiness, empathy, and effectiveness" Nature Machine Intelligence 2023.
☆16Updated last year
Alternatives and similar repositories for nmi-ai-2023:
Users that are interested in nmi-ai-2023 are comparing it to the libraries listed below
- ☆93Updated 11 months ago
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- ☆23Updated last year
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated 11 months ago
- code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations☆10Updated last year
- ☆36Updated 2 years ago
- ☆12Updated 2 years ago
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 5 months ago
- ☆106Updated last year
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated last year
- Tasks for describing differences between text distributions.☆16Updated 8 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"☆60Updated last year
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆75Updated last year
- This repo contains code for the paper "Psychologically-informed chain-of-thought prompts for metaphor understanding in large language mod…☆14Updated 2 years ago
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts☆60Updated last year
- ☆16Updated 8 months ago
- This repository contains data, code and models for contextual noncompliance.☆22Updated 9 months ago
- The KiloGram Tangrams dataset☆55Updated last week
- Stanford NLP Python library for benchmarking the utility of LLM interpretability methods☆76Updated last month
- ☆34Updated last year
- ☆78Updated 2 years ago
- Reference implementation for Reward-Augmented Decoding: Efficient Controlled Text Generation With a Unidirectional Reward Model☆44Updated last year
- Inspecting and Editing Knowledge Representations in Language Models☆116Updated last year
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Updated 7 months ago
- ☆68Updated last year
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆64Updated last year
- Data and code for the paper "NormBank: A Knowledge Bank of Situational Social Norms"☆27Updated last year
- Evaluating the Moral Beliefs Encoded in LLMs☆26Updated 4 months ago
- Official Repo for the Paper "AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution o…☆12Updated 3 months ago