jennhu / metalinguistic-prompting
Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)
☆23Updated last year
Alternatives and similar repositories for metalinguistic-prompting:
Users that are interested in metalinguistic-prompting are comparing it to the libraries listed below
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆22Updated last month
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆20Updated 2 years ago
- ☆16Updated 3 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Teaching Models to Express Their Uncertainty in Words☆38Updated 2 years ago
- ☆34Updated 3 years ago
- ☆34Updated 10 months ago
- ☆44Updated last year
- ☆48Updated 2 years ago
- Code repository for the paper "Mission: Impossible Language Models."☆52Updated last week
- The evaluation pipeline for the 2024 BabyLM Challenge.☆30Updated 5 months ago
- ☆24Updated 2 years ago
- ☆26Updated 4 months ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆41Updated 4 months ago
- ☆82Updated 2 years ago
- Inspecting and Editing Knowledge Representations in Language Models☆115Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆14Updated last year
- Code and data associated with the AmbiEnt dataset in "We're Afraid Language Models Aren't Modeling Ambiguity" (Liu et al., 2023)☆61Updated last year
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆78Updated last year
- ☆58Updated 2 years ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 8 months ago
- This code accompanies the paper DisentQA: Disentangling Parametric and Contextual Knowledge with Counterfactual Question Answering.☆17Updated 2 years ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆90Updated 3 years ago
- ☆46Updated last year
- Supporting code for ReCEval paper☆28Updated 7 months ago
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆37Updated 2 years ago
- Follow the Wisdom of the Crowd: Effective Text Generation via Minimum Bayes Risk Decoding☆18Updated 2 years ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- ☆22Updated last year