mohsenfayyaz / DecompX
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition
☆15Updated last year
Related projects ⓘ
Alternatives and complementary repositories for DecompX
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21Updated last year
- Measuring the Mixing of Contextual Information in the Transformer☆25Updated last year
- The official implementation of the ACL 2023 paper, "Paraphrasing-Guided Data Augmentation for Contrastive Prompt-based Few-shot Fine-tuni…☆10Updated 11 months ago
- Repository for the Bias Benchmark for QA dataset.☆85Updated 10 months ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆124Updated last year
- ☆37Updated last year
- ☆109Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆15Updated 2 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆21Updated this week
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆42Updated 7 months ago
- ☆28Updated 3 years ago
- ☆16Updated last year
- ☆26Updated 6 months ago
- Code for ACL 2023 paper "BOLT: Fast Energy-based Controlled Text Generation with Tunable Biases".☆18Updated last year
- ☆16Updated 2 years ago
- ☆23Updated 2 years ago
- Dataset associated with "BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language Generation" paper☆65Updated 3 years ago
- Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆21Updated 3 weeks ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆19Updated 3 years ago
- Code and data for Marked Personas (ACL 2023)☆21Updated last year
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆86Updated 3 years ago
- Probing for Labeled Dependency Trees (ACL 2022) + Sorting LMs by Structure (NAACL 2022)☆8Updated 5 months ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆85Updated 3 years ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆12Updated 2 years ago
- ☆13Updated 2 years ago
- [ICML 2021] Towards Understanding and Mitigating Social Biases in Language Models☆60Updated 2 years ago
- ☆77Updated 6 months ago
- ☆58Updated last year
- ☆32Updated last year
- Dataset, metrics, and models for TACL 2023 paper MACSUM: Controllable Summarization with Mixed Attributes.☆34Updated last year