mitmedialab / nmi-ai-2023
A repository for the paper "Beliefs about AI influence human-AI interaction and can be manipulated to increase perceived trustworthiness, empathy, and effectiveness" Nature Machine Intelligence 2023.
☆15Updated last year
Alternatives and similar repositories for nmi-ai-2023:
Users that are interested in nmi-ai-2023 are comparing it to the libraries listed below
- Repo for: When to Make Exceptions: Exploring Language Models as Accounts of Human Moral Judgment☆38Updated last year
- ☆21Updated 9 months ago
- ☆90Updated 8 months ago
- Evaluating the Moral Beliefs Encoded in LLMs☆23Updated 2 months ago
- ☆35Updated 2 years ago
- ☆12Updated 2 years ago
- ☆104Updated 9 months ago
- Tasks for describing differences between text distributions.☆16Updated 6 months ago
- A collection of works that investigate social agents, simulations and their real-world impact in text, embodied, and robotics contexts.☆79Updated 8 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"☆71Updated last year
- DialOp: Decision-oriented dialogue environments for collaborative language agents☆106Updated 3 months ago
- ☆36Updated last year
- PATIENT-Ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals (EMNLP 2024)☆58Updated 3 months ago
- Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique☆12Updated 5 months ago
- This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…☆59Updated last year
- Data and code for the paper "NormBank: A Knowledge Bank of Situational Social Norms"☆24Updated last year
- ☆16Updated 11 months ago
- Governance of the Commons Simulation (GovSim)☆36Updated last month
- Sotopia-π: Interactive Learning of Socially Intelligent Language Agents (ACL 2024)☆57Updated 9 months ago
- The Prism Alignment Project☆66Updated 9 months ago
- 👻 Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"☆52Updated 8 months ago
- ☆31Updated last year
- Alignment with a millennium of moral progress. Spotlight@NeurIPS 2024 Track on Datasets and Benchmarks.☆20Updated last month
- Code for our NeurIPS'24 Dataset and Benchmark paper: Cooperation, Competition, and Maliciousness: LLM-Stakeholders Interactive Negotiatio…☆25Updated 3 months ago
- Neuron Activation☆23Updated 3 months ago
- Code/data for MARG (multi-agent review generation)☆38Updated 3 months ago
- code and data associated with CoMPosT: Characterizing and Evaluating Caricature in LLM Simulations☆10Updated last year
- ☆40Updated last year
- Implementation of PaCE: Parsimonious Concept Engineering for Large Language Models (NeurIPS 2024)☆32Updated 3 months ago
- ✨ Resolving Knowledge Conflicts in Large Language Models, COLM 2024☆15Updated 4 months ago