jennhu / metalinguistic-prompting
Materials for "Prompting is not a substitute for probability measurements in large language models" (EMNLP 2023)
☆21Updated last year
Alternatives and similar repositories for metalinguistic-prompting:
Users that are interested in metalinguistic-prompting are comparing it to the libraries listed below
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- The evaluation pipeline for the 2024 BabyLM Challenge.☆29Updated 4 months ago
- ☆24Updated 3 months ago
- ☆34Updated 3 years ago
- Code repository for the paper "Mission: Impossible Language Models."☆48Updated this week
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆41Updated 3 months ago
- 🌾 Universal, customizable and deployable fine-grained evaluation for text generation.☆22Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆19Updated 2 years ago
- Exploring the Limitations of Large Language Models on Multi-Hop Queries☆24Updated 2 weeks ago
- Inspecting and Editing Knowledge Representations in Language Models☆112Updated last year
- ☆31Updated 8 months ago
- ☆22Updated last year
- A curated list of research papers and resources on Cultural LLM.☆40Updated 5 months ago
- ☆58Updated 2 years ago
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆41Updated last year
- ☆38Updated 10 months ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆75Updated last year
- Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"☆75Updated last year
- Apps built using Inspired Cognition's Critique.☆58Updated 2 years ago
- The LM Contamination Index is a manually created database of contamination evidences for LMs.☆77Updated 11 months ago
- ☆104Updated 10 months ago
- Official repository for our EACL 2023 paper "LongEval: Guidelines for Human Evaluation of Faithfulness in Long-form Summarization" (https…☆43Updated 7 months ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆15Updated last year
- Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑☆14Updated 10 months ago
- ☆44Updated last year
- A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.☆100Updated last year
- Supporting code for ReCEval paper☆28Updated 6 months ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆21Updated this week
- ☆65Updated last year
- Code for the arXiv paper: "LLMs as Factual Reasoners: Insights from Existing Benchmarks and Beyond"☆59Updated last month