mohsenfayyaz / DecompX
DecompX: Explaining Transformers Decisions by Propagating Token Decomposition
☆15Updated last year
Alternatives and similar repositories for DecompX:
Users that are interested in DecompX are comparing it to the libraries listed below
- [NAACL 2022] GlobEnc: Quantifying Global Token Attribution by Incorporating the Whole Encoder Layer in Transformers☆21Updated last year
- The geometry of multilingual language model representations (EMNLP 2022).☆16Updated 2 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆27Updated last year
- The official implementation of the ACL 2023 paper, "Paraphrasing-Guided Data Augmentation for Contrastive Prompt-based Few-shot Fine-tuni…☆10Updated last year
- ☆46Updated last year
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆44Updated 10 months ago
- ☆30Updated 3 years ago
- Easy-to-use framework for evaluating cross-lingual consistency of factual knowledge (Supported LLaMA, BLOOM, mT5, RoBERTa, etc.) Paper he…☆22Updated 2 months ago
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages☆23Updated last month
- ☆16Updated last year
- LoFiT: Localized Fine-tuning on LLM Representations☆30Updated 2 weeks ago
- ☆23Updated last month
- This repository contains the dataset and code for "WiCE: Real-World Entailment for Claims in Wikipedia" in EMNLP 2023.☆40Updated last year
- ☆13Updated 7 months ago
- UnQovering Stereotyping Biases via Underspecified Questions - EMNLP 2020 (Findings)☆20Updated 3 years ago
- ☆24Updated 2 years ago
- Repo for paper: Examining LLMs' Uncertainty Expression Towards Questions Outside Parametric Knowledge☆12Updated 11 months ago
- A Large-Scale Gender Bias Dataset for Coreference Resolution and Machine Translation, Levy et al., Findings of EMNLP 2021☆13Updated 2 years ago
- Code for the paper "REV: Information-Theoretic Evaluation of Free-Text Rationales"☆15Updated last year
- Official Code for EMNLP2023 Main Conference paper: "KCTS: Knowledge-Constrained Tree Search Decoding with Token-Level Hallucination Detec…☆30Updated last year
- Probing for Labeled Dependency Trees (ACL 2022) + Sorting LMs by Structure (NAACL 2022)☆8Updated 7 months ago
- Code and test data for "On Measuring Bias in Sentence Encoders", to appear at NAACL 2019.☆54Updated 3 years ago
- Probing and Generalization of Metaphorical Knowledge in Pre-Trained Language Modelss[ACL 2022]☆21Updated 2 years ago
- ☆25Updated 11 months ago
- Easy-to-use MIRAGE code for faithful answer attribution in RAG applications. Paper: https://aclanthology.org/2024.emnlp-main.347/☆21Updated last month
- tianlu-wang / Identifying-and-Mitigating-Spurious-Correlations-for-Improving-Robustness-in-NLP-ModelsNAACL 2022 Findings☆15Updated 2 years ago
- Code repository for the paper "Mission: Impossible Language Models."☆42Updated 2 weeks ago
- ACL 2022: An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language Models.☆130Updated last month
- ☆13Updated 10 months ago
- [NAACL'25] Steering Knowledge Selection Behaviours in LLMs via SAE-Based Representation Engineering☆41Updated 2 months ago