mohsenfayyaz / DecompXLinks

DecompX: Explaining Transformers Decisions by Propagating Token Decomposition [ACL 2023]

☆18

Alternatives and similar repositories for DecompX

Users that are interested in DecompX are comparing it to the libraries listed below

Sorting:

mt-upc / transformer-contributions
Measuring the Mixing of Contextual Information in the Transformer
☆31Updated 2 years ago
tylerachang / multilingual-geometry
The geometry of multilingual language model representations (EMNLP 2022).
☆21Updated 2 years ago
McGill-NLP / bias-bench
ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.
☆139Updated 7 months ago
gorokoba560 / norm-analysis-of-transformer
☆86Updated last year
Betswish / MIRAGE
Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/
☆24Updated 4 months ago
interpretingdl / eacl2024_transformer_interpretability_tutorial
Materials for EACL2024 tutorial: Transformer-specific Interpretability
☆59Updated last year
Betswish / Cross-Lingual-Consistency
Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…
☆25Updated 4 months ago
alisawuffles / DExperts
code associated with ACL 2021 DExperts paper
☆115Updated 2 years ago
TideDancer / iclr21_isotropy_contxt
☆30Updated 4 years ago
timoschick / self-debiasing
This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".
☆88Updated 3 years ago
ryokamoi / wice
This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.
☆41Updated last year
MadryLab / AT2
Attribute statements generated by LLMs to preceding tokens using attention weights.
☆15Updated 2 months ago
rudinger / winogender-schemas
Data for evaluating gender bias in coreference resolution systems.
☆79Updated 6 years ago
nyu-mll / crows-pairs
This repository contains the data and code introduced in the paper "CrowS-Pairs: A Challenge Dataset for Measuring Social Biases in Maske…
☆122Updated last year
nyu-mll / BBQ
Repository for the Bias Benchmark for QA dataset.
☆123Updated last year
shauli-ravfogel / nullspace_projection
☆89Updated 3 years ago
amazon-science / bold
Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper
☆79Updated 4 years ago
mega002 / ff-layers
The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…
☆94Updated 3 years ago
yasumasaonoe / entity_knowledge_propagation
☆17Updated last year
mohsenfayyaz / GlobEnc
[NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers
☆21Updated 2 years ago
kernelmachine / demix-data
Benchmark API for Multidomain Language Modeling
☆25Updated 2 years ago
neulab / knn-transformers
PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an…
☆275Updated 2 years ago
BatsResearch / crosslingual-test-time-scaling
Crosslingual Reasoning through Test-Time Scaling
☆18Updated 2 months ago
velocityCavalry / CREPE
An original implementation of the paper "CREPE: Open-Domain Question Answering with False Presuppositions"
☆16Updated 8 months ago
personads / depprobe
Probing for Labeled Dependency Trees (ACL 2022) + Sorting LMs by Structure (NAACL 2022)
☆8Updated last year
yanaiela / pararel
☆45Updated last year
tingofurro / summac
Codebase, data and models for the SummaC paper in TACL
☆97Updated 5 months ago
SeaEval / SeaEval
NAACL 2024: SeaEval for Multilingual Foundation Models: From Cross-Lingual Alignment to Cultural Reasoning
☆25Updated 4 months ago
kojima-takeshi188 / lang_neuron
☆18Updated last year
SALT-NLP / implicit-hate
☆40Updated 2 years ago