jkallini / mission-impossible-language-modelsLinks

Code repository for the paper "Mission: Impossible Language Models."

☆52

Alternatives and similar repositories for mission-impossible-language-models

Users that are interested in mission-impossible-language-models are comparing it to the libraries listed below

Sorting:

epfl-dlab / llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆78Updated last year
mega002 / ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆94Updated 3 years ago
edenbiran / HoppingTooLate
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆27Updated 5 months ago
google-research-datasets / GSM-IC
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…
☆60Updated 2 years ago
activatedgeek / calibration-tuning
☆51Updated 3 months ago
Betswish / MIRAGE
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆24Updated 4 months ago
roeehendel / icl_task_vectors
☆96Updated last year
xlang-ai / icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
☆108Updated 2 years ago
aryamanarora / causalgym
CausalGym: Benchmarking causal interpretability methods on linguistic tasks
☆46Updated 8 months ago
allenai / noncompliance
This repository contains data, code and models for contextual noncompliance.
☆23Updated last year
dannyallover / overthinking_the_truth
☆29Updated last year
explanare / ravel
Evaluate interpretability methods on localizing and disentangling concepts in LLMs.
☆52Updated 10 months ago
abertsch72 / long-context-icl
Data and code for the preprint "In-Context Learning with Long-Context Models: An In-Depth Exploration"
☆38Updated 11 months ago
aviclu / ffn-values
☆62Updated 2 years ago
yuzhaouoe / SAE-based-representation-engineering
[NAACL'25 Oral] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering
☆62Updated 8 months ago
nouhadziri / faith-and-fate
☆34Updated last year
google / belief-localization
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…
☆61Updated 2 years ago
ruiqi-zhong / nlparam
Augmenting Statistical Models with Natural Language Parameters
☆27Updated 10 months ago
jzbjyb / lm-calibration
☆35Updated 3 years ago
saprmarks / geometry-of-truth
☆89Updated 11 months ago
evandez / REMEDI
Inspecting and Editing Knowledge Representations in Language Models
☆116Updated 2 years ago
sylinrl / CalibratedMath
Teaching Models to Express Their Uncertainty in Words
☆39Updated 3 years ago
McGill-NLP / polytropon
☆54Updated 2 years ago
jmhessel / caption_contest_corpus
Corpus to accompany: "Do Androids Laugh at Electric Sheep? Humor "Understanding" Benchmarks from The New Yorker Caption Contest"
☆56Updated 4 months ago
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Updated 2 years ago
lukemelas / mtob
☆37Updated last year
tommccoy1 / embers-of-autoregression
☆29Updated 7 months ago
Nanami18 / Snowballed_Hallucination
☆45Updated 11 months ago
archiki / ReCEval
Supporting code for ReCEval paper
☆29Updated 10 months ago
ajyl / dpo_toxic
A Mechanistic Understanding of Alignment Algorithms: A Case Study on DPO and Toxicity.
☆74Updated 4 months ago