technion-cs-nlp / Individual-Neurons-PitfallsLinks

☆10

Alternatives and similar repositories for Individual-Neurons-Pitfalls

Users that are interested in Individual-Neurons-Pitfalls are comparing it to the libraries listed below

Sorting:

yanaiela / pararel
☆45Updated last year
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆38Updated 2 years ago
ruiqi-zhong / DescribeDistributionalDifferences
Code for preprint: Summarizing Differences between Text Distributions with Natural Language
☆42Updated 2 years ago
keyonvafa / sequential-rationales
Rationales for Sequential Predictions
☆40Updated 3 years ago
peterbhase / LAS-NL-Explanations
Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"
☆22Updated 4 years ago
jacobandreas / geca
☆42Updated 4 years ago
allenai / contrastive-explanations
Explaining neural decisions contrastively to alternative decisions.
☆25Updated 4 years ago
princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆76Updated last year
belindal / state-probes
Code for the paper "Implicit Representations of Meaning in Neural Language Models"
☆54Updated 2 years ago
hsajjad / Interpretability-Tutorial-NAACL2021
☆24Updated 4 years ago
aaronmueller / MIB
Landing page for MIB: A Mechanistic Interpretability Benchmark
☆16Updated last week
awebson / prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆85Updated 3 years ago
swarnaHub / ExplaGraphs
[EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning
☆12Updated 2 years ago
violet-zct / swarm-distillation-zero-shot
☆22Updated 2 years ago
nyu-mll / SQuALITY
Query-focused summarization data
☆42Updated 2 years ago
tuvuumass / task-transferability
Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.
☆50Updated 4 years ago
frankxu2004 / knnlm-why
Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"
☆58Updated 2 years ago
BrachioLab / incontext_influences
In-context Example Selection with Influences
☆15Updated 2 years ago
CoderPat / learning-scaffold
This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"
☆19Updated 3 years ago
krishnap25 / mauve-experiments
☆38Updated last year
neulab / retomaton
PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)
☆73Updated 3 years ago
technion-cs-nlp / bias-probing
Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data
☆14Updated 3 years ago
McGill-NLP / polytropon
☆54Updated 2 years ago
allenai / label_rationale_association
Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"
☆12Updated last year
leo-liuzy / probe-across-time
☆22Updated 3 years ago
allenai / few_shot_explanations
Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"
☆31Updated 2 years ago
peterwestuw / surface-form-competition
☆58Updated 3 years ago
salesforce / fast-influence-functions
☆89Updated 2 months ago
GEM-benchmark / GEM-metrics
Automatic metrics for GEM tasks
☆66Updated 2 years ago
kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆53Updated 3 years ago