mega002 / ff-layersLinks

The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Levy. EMNLP, 2021.

☆94

Alternatives and similar repositories for ff-layers

Users that are interested in ff-layers are comparing it to the libraries listed below

Sorting:

princeton-nlp / LM-Kernel-FT
A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643
☆78Updated last year
aviclu / ffn-values
☆62Updated 2 years ago
nkandpa2 / long_tail_knowledge
Repo for the paper "Large Language Models Struggle to Learn Long-Tail Knowledge"
☆77Updated 2 years ago
google / belief-localization
This repository includes code for the paper "Does Localization Inform Editing? Surprising Differences in Where Knowledge Is Stored vs. Ca…
☆61Updated 2 years ago
qkaren / COLD_decoding
☆108Updated 3 years ago
GXimingLu / Quark
☆75Updated last year
evandez / REMEDI
Inspecting and Editing Knowledge Representations in Language Models
☆116Updated 2 years ago
swj0419 / in-context-pretraining
☆53Updated last year
roeehendel / icl_task_vectors
☆96Updated last year
HazyResearch / skill-it
Skill-It! A Data-Driven Skills Framework for Understanding and Training Language Models
☆46Updated last year
yanaiela / pararel
☆45Updated last year
tau-nlp / scrolls
The official code of EMNLP 2022, "SCROLLS: Standardized CompaRison Over Long Language Sequences".
☆70Updated last year
alestolfo / lm-arithmetic
Code for the paper "A Mechanistic Interpretation of Arithmetic Reasoning in Language Models using Causal Mediation Analysis"
☆18Updated last month
nayeon7lee / FactualityPrompt
☆87Updated 2 years ago
Alrope123 / rethinking-demonstrations
☆178Updated last year
balevinstein / Probes
☆52Updated 2 years ago
HKUNLP / icl-ceil
[ICML 2023] Code for our paper “Compositional Exemplars for In-context Learning”.
☆102Updated 2 years ago
jaehunjung1 / Maieutic-Prompting
☆50Updated last year
saprmarks / geometry-of-truth
☆87Updated 11 months ago
YuxiXie / SelfEval-Guided-Decoding
☆99Updated last year
nicola-decao / KnowledgeEditor
Code for Editing Factual Knowledge in Language Models
☆139Updated 3 years ago
edenbiran / HoppingTooLate
Exploring the Limitations of Large Language Models on Multi-Hop Queries
☆27Updated 5 months ago
google-research-datasets / GSM-IC
Grade-School Math with Irrelevant Context (GSM-IC) benchmark is an arithmetic reasoning dataset built upon GSM8K, by adding irrelevant se…
☆60Updated 2 years ago
xlang-ai / icl-selective-annotation
[ICLR 2023] Code for our paper "Selective Annotation Makes Language Models Better Few-Shot Learners"
☆108Updated 2 years ago
epfl-dlab / llm-latent-language
Repo accompanying our paper "Do Llamas Work in English? On the Latent Language of Multilingual Transformers".
☆78Updated last year
ruiqi-zhong / nlparam
Augmenting Statistical Models with Natural Language Parameters
☆27Updated 10 months ago
Thartvigsen / GRACE
[NeurIPS'23] Aging with GRACE: Lifelong Model Editing with Discrete Key-Value Adaptors
☆77Updated 7 months ago
hkust-nlp / felm
Github repository for "FELM: Benchmarking Factuality Evaluation of Large Language Models" (NeurIPS 2023)
☆59Updated last year
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆38Updated 2 years ago
eric-mitchell / mend
MEND: Fast Model Editing at Scale
☆249Updated last year