fdalvi / NeuroXLinks
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆106Updated 2 years ago
Alternatives and similar repositories for NeuroX
Users that are interested in NeuroX are comparing it to the libraries listed below
Sorting:
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆179Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Updated last year
- A library for finding knowledge neurons in pretrained transformer models.☆159Updated 3 years ago
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆82Updated 2 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- ☆65Updated 2 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆50Updated 3 years ago
- ☆97Updated 3 years ago
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆130Updated 4 months ago
- Measuring the Mixing of Contextual Information in the Transformer☆31Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆93Updated last year
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆88Updated 4 years ago
- ☆55Updated 2 years ago
- M2D2: A Massively Multi-domain Language Modeling Dataset (EMNLP 2022) by Machel Reid, Victor Zhong, Suchin Gururangan, Luke Zettlemoyer☆54Updated 2 years ago
- Code of NAACL 2022 "Efficient Hierarchical Domain Adaptation for Pretrained Language Models" paper.☆32Updated 2 years ago
- ☆39Updated 4 years ago
- To analyze and remove gender bias in coreference resolution systems☆79Updated 5 months ago
- ☆90Updated 3 years ago
- Interpreting Language Models with Contrastive Explanations (EMNLP 2022 Best Paper Honorable Mention)☆62Updated 3 years ago
- A software for transferring pre-trained English models to foreign languages☆19Updated 2 years ago
- Query-focused summarization data☆42Updated 2 years ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆20Updated 2 months ago
- Code repository for the paper "Mission: Impossible Language Models."☆54Updated last month
- ☆46Updated last year
- ☆87Updated last year
- Replication code for "With Little Power Comes Great Responsibility"☆39Updated 5 years ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆97Updated 4 years ago
- Data for evaluating gender bias in coreference resolution systems.☆80Updated 6 years ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated last year
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated 2 years ago