mt-upc / transformer-contributionsLinks

Measuring the Mixing of Contextual Information in the Transformer

☆31

Alternatives and similar repositories for transformer-contributions

Users that are interested in transformer-contributions are comparing it to the libraries listed below

Sorting:

tylerachang / multilingual-geometry
The geometry of multilingual language model representations (EMNLP 2022).
☆21Updated 2 years ago
mt-upc / transformer-contributions-nmt
☆17Updated 2 years ago
gsarti / pecore
Materials for "Quantifying the Plausibility of Context Reliance in Neural Machine Translation" at ICLR'24 🐑 🐑
☆15Updated last year
gorokoba560 / norm-analysis-of-transformer
☆86Updated last year
deep-spin / hallucinations-in-nmt
☆20Updated last year
lukemelas / mtob
☆37Updated last year
kernelmachine / demix-data
Benchmark API for Multidomain Language Modeling
☆25Updated 2 years ago
ekinakyurek / influence
Code for "Tracing Knowledge in Language Models Back to the Training Data"
☆38Updated 2 years ago
GEM-benchmark / GEM-metrics
Automatic metrics for GEM tasks
☆66Updated 2 years ago
kernelmachine / demix
DEMix Layers for Modular Language Modeling
☆53Updated 3 years ago
jzbjyb / lm-calibration
☆35Updated 3 years ago
aviclu / ffn-values
☆62Updated 2 years ago
SeaEval / SeaEval
NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
☆25Updated 5 months ago
mprompting / xlmrprompt
☆11Updated 3 years ago
hsajjad / Interpretability-Tutorial-NAACL2021
☆24Updated 4 years ago
mohsenfayyaz / DecompX
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]
☆18Updated last month
krishnap25 / mauve-experiments
☆38Updated last year
mohsenfayyaz / GlobEnc
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21Updated 2 years ago
john-hewitt / truncation-sampling
Codebase describing experiments in Truncation Sampling as Language Model Desmoothing
☆12Updated 2 years ago
alisawuffles / DExperts
code associated with ACL 2021 DExperts paper
☆115Updated 2 years ago
john-hewitt / backpacks-flash-attn
The original Backpack Language Model implementation, a fork of FlashAttention
☆69Updated 2 years ago
shadowkiller33 / ParaScore
☆29Updated 2 years ago
machelreid / m2d2
M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer
☆54Updated 2 years ago
cambridgeltl / composable-sft
A library for parameter-efficient and composable transfer learning for NLP with sparse fine-tunings.
☆74Updated 11 months ago
faridlazuarda / cultural-llm-papers
A curated list of research papers and resources on Cultural LLM.
☆46Updated 10 months ago
ZurichNLP / mbr
Minimum Bayes Risk Decoding for Hugging Face Transformers
☆58Updated last year
kojima-takeshi188 / lang_neuron
☆18Updated last year
qkaren / COLD_decoding
☆108Updated 3 years ago
awebson / prompt_semantics
This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”
☆85Updated 3 years ago
peterwestuw / surface-form-competition
☆58Updated 3 years ago