cicl-stanford / moca
Language model evaluation for morality and causality
β16Updated last year
Alternatives and similar repositories for moca:
Users that are interested in moca are comparing it to the libraries listed below
- Apps built using Inspired Cognition's Critique.β58Updated last year
- π» Code and benchmark for our EMNLP 2023 paper - "FANToM: A Benchmark for Stress-testing Machine Theory of Mind in Interactions"β52Updated 7 months ago
- Code accompanying our papers on the "Generative Distributional Control" frameworkβ117Updated 2 years ago
- β22Updated 10 months ago
- datasets from the paper "Towards Understanding Sycophancy in Language Models"β66Updated last year
- β45Updated last year
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (httpsβ¦β43Updated 5 months ago
- β39Updated last year
- Code repository for the paper "Mission: Impossible Language Models."β41Updated this week
- Data and code for the paper "Inducing Positive Perspectives with Text Reframing"β54Updated last year
- Inspecting and Editing Knowledge Representations in Language Modelsβ111Updated last year
- β14Updated 9 months ago
- A Toolkit for Distributional Control of Generative Modelsβ70Updated last year
- This repository accompanies our paper βDo Prompt-Based Models Really Understand the Meaning of Their Prompts?ββ85Updated 2 years ago
- β26Updated last month
- IntructIR, a novel benchmark specifically designed to evaluate the instruction following ability in information retrieval models. Our focβ¦β31Updated 7 months ago
- β100Updated 8 months ago
- Experiments on including metadata such as URLs, timestamps, website descriptions and HTML tags during pretraining.