fdalvi / NeuroX
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆97Updated last year
Related projects ⓘ
Alternatives and complementary repositories for NeuroX
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆55Updated 5 months ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆41Updated 7 months ago
- Simple-to-use scoring function for arbitrarily tokenized texts.☆31Updated 2 weeks ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆81Updated last year
- ☆87Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆76Updated 6 months ago
- ☆95Updated 2 years ago
- Utility for behavioral and representational analyses of Language Models☆121Updated 2 months ago
- Data for evaluating gender bias in coreference resolution systems.☆67Updated 5 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆25Updated last year
- ☆57Updated 4 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆84Updated 2 years ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆148Updated last week
- A library for finding knowledge neurons in pretrained transformer models.☆151Updated 2 years ago
- The Benchmark of Linguistic Minimal Pairs☆141Updated last year
- ☆23Updated 2 years ago
- Highlight errors in a bib file: missing URLs, capitalization protection, etc☆24Updated 5 months ago
- ☆77Updated 6 months ago
- Experiments and code to generate the GINC small-scale in-context learning dataset from "An Explanation for In-context Learning as Implici…☆94Updated last year
- ☆94Updated 6 months ago
- ☆20Updated 4 months ago
- ☆65Updated last year
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆25Updated last month
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆46Updated 2 years ago
- Utilities for the HuggingFace transformers library☆61Updated last year
- ☆37Updated 4 years ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆84Updated 3 years ago
- ☆38Updated last year
- A Python Commonsense Knowledge Inference Toolkit☆60Updated 10 months ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆172Updated 2 years ago