mitmedialab / nmi-ai-2023Links
A repository for the paper "Beliefs about AI influence human-AI interaction and can be manipulated to increase perceived trustworthiness, empathy, and effectiveness" Nature Machine Intelligence 2023.
☆17Updated last year
Alternatives and similar repositories for nmi-ai-2023
Users that are interested in nmi-ai-2023 are comparing it to the libraries listed below
Sorting:
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated 2 years ago
- ☆23Updated last year
- ☆95Updated last year
- code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations☆10Updated last year
- ☆12Updated 2 years ago
- Tasks for describing differences between text distributions.☆16Updated 10 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆26Updated 6 months ago
- ☆106Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 7 months ago
- Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique☆17Updated 10 months ago
- This repo contains code for our NeurIPS 2023 spotlight paper: Evaluating and Inducing Personality in Pre-trained Language Models☆52Updated last year
- This repository contains the official code for the paper: "Prompt Injection: Parameterization of Fixed Inputs"☆32Updated 9 months ago
- ☆36Updated 2 years ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆61Updated 2 years ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆55Updated last year
- ☆78Updated 2 years ago
- This repository contains data, code and models for contextual noncompliance.☆23Updated 11 months ago
- Source code and data for ADEPT: A DEbiasing PrompT Framework (AAAI-23).☆15Updated 6 months ago
- ☆35Updated last month
- ☆35Updated 2 years ago
- Data Valuation on In-Context Examples (ACL23)☆23Updated 5 months ago
- Codes and Datasets for our ACL 2023 paper on cognitive reframing of negative thoughts☆63Updated last year
- Code and Data for the NAACL 24 paper: MacGyver: Are Large Language Models Creative Problem Solvers?☆28Updated last year
- ☆16Updated 9 months ago
- Restore safety in fine-tuned language models through task arithmetic☆28Updated last year
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆89Updated last year
- Byte-sized text games for code generation tasks on virtual environments☆19Updated 11 months ago
- Recent papers on (1) Psychology of LLMs; (2) Biases in LLMs.☆49Updated last year
- ☆28Updated last year
- PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆37Updated 7 months ago