fdalvi / NeuroX
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆100Updated last year
Alternatives and similar repositories for NeuroX:
Users that are interested in NeuroX are comparing it to the libraries listed below
- Simple-to-use scoring function for arbitrarily tokenized texts.☆38Updated last week
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆82Updated last year
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆44Updated 11 months ago
- ☆89Updated 2 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆56Updated 9 months ago
- To analyze and remove gender bias in coreference resolution systems☆76Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆80Updated 10 months ago
- Utility for behavioral and representational analyses of Language Models☆128Updated last week
- How Contextual are Contextualized Word Representations?☆41Updated 4 years ago
- ☆29Updated 8 months ago
- Evaluation pipeline for the BabyLM Challenge 2023.☆75Updated last year
- Source code for CoNLL 2021 paper by Huebner et al. 2021☆17Updated last year
- Are foundation LMs multilingual knowledge bases? (EMNLP 2023)☆19Updated last year
- How do transformer LMs encode relations?☆46Updated last year
- ☆65Updated last year
- ☆38Updated last year
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆26Updated 5 months ago
- Rationales for Sequential Predictions☆40Updated 2 years ago
- A library for finding knowledge neurons in pretrained transformer models.☆154Updated 3 years ago
- Code repository for the paper "Mission: Impossible Language Models."☆47Updated last week
- ☆39Updated 3 years ago
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆51Updated 2 years ago
- ☆96Updated 2 years ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 2 years ago
- Glot500: Scaling Multilingual Corpora and Language Models to 500 Languages -- ACL 2023☆99Updated 10 months ago
- ☆203Updated last week
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆48Updated 2 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆55Updated 2 years ago
- Measuring the Mixing of Contextual Information in the Transformer☆28Updated last year