fdalvi / NeuroXLinks
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆106Updated 2 years ago
Alternatives and similar repositories for NeuroX
Users that are interested in NeuroX are comparing it to the libraries listed below
Sorting:
- A library for finding knowledge neurons in pretrained transformer models.☆158Updated 3 years ago
- Minimum Bayes Risk Decoding for Hugging Face Transformers☆60Updated last year
- diagNNose is a Python library that facilitates a broad set of tools for analysing hidden activations of neural models.☆82Updated 2 years ago
- The official code of LM-Debugger, an interactive tool for inspection and intervention in transformer-based language models.☆180Updated 3 years ago
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- ☆65Updated 2 years ago
- A curated list of awesome datasets with human label variation (un-aggregated labels) in Natural Language Processing and Computer Vision, …☆94Updated last year
- This repository contains the code for "Self-Diagnosis and Self-Debiasing: A Proposal for Reducing Corpus-Based Bias in NLP".☆88Updated 4 years ago
- To analyze and remove gender bias in coreference resolution systems☆79Updated 6 months ago
- ☆90Updated 3 years ago
- Repository collecting resources and best practices to improve experimental rigour in deep learning research.☆27Updated 2 years ago
- ☆97Updated 3 years ago
- How Contextual are Contextualized Word Representations?☆42Updated 5 years ago
- Code associated with the paper "Entropy-based Attention Regularization Frees Unintended Bias Mitigation from Lists"☆50Updated 3 years ago
- Repository for research in the field of Responsible NLP at Meta.☆202Updated 6 months ago
- A python package to run inference with HuggingFace language and vision-language checkpoints wrapping many convenient features.☆28Updated last year
- ☆55Updated 2 years ago
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆55Updated 2 years ago
- Transformer Grammars: Augmenting Transformer Language Models with Syntactic Inductive Biases at Scale, TACL (2022)☆132Updated 5 months ago
- StereoSet: Measuring stereotypical bias in pretrained language models☆192Updated 2 years ago
- Source code for CoNLL 2021 paper by Huebner et al. 2021☆20Updated 2 years ago
- Data for evaluating gender bias in coreference resolution systems.☆81Updated 6 years ago
- Highlight errors in a bib file: missing URLs, capitalization protection, etc☆27Updated last year
- A software for transferring pre-trained English models to foreign languages☆19Updated 2 years ago
- Materials for EACL2024 tutorial: Transformer-specific Interpretability☆60Updated last year
- Utilities for the HuggingFace transformers library☆71Updated 2 years ago
- Replication code for "With Little Power Comes Great Responsibility"☆39Updated 5 years ago
- PAIR.withgoogle.com and friend's work on interpretability methods☆214Updated this week
- ☆39Updated 4 years ago
- Official code and model checkpoints for our EMNLP 2022 paper "RankGen - Improving Text Generation with Large Ranking Models" (https://arx…☆138Updated 2 years ago