A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆108Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for NeuroX
Users that are interested in NeuroX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- Analyzing Latent Concept in Pre-trained Transformer Models☆12Jul 18, 2022Updated 3 years ago
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆72Aug 20, 2024Updated last year
- Making a bridge between NLP models and Brain data☆19Jun 3, 2020Updated 5 years ago
- Mechanistic Interpretability for Transformer Models☆53Jun 1, 2022Updated 3 years ago
- ☆38Apr 23, 2019Updated 7 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- [Kauf & Ivanova, ACL 2023] A Better Way to Do Masked Language Model Scoring☆12Dec 1, 2023Updated 2 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆53Nov 30, 2024Updated last year
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated last year
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆14Apr 14, 2021Updated 5 years ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆30May 23, 2024Updated last year
- Repository describing example random control tasks for designing and interpreting neural probes☆32Jun 21, 2022Updated 3 years ago
- Wordpress hosting with auto-scaling - Free Trial Offer • AdFully Managed hosting for WordPress and WooCommerce businesses that need reliable, auto-scalable performance. Cloudways SafeUpdates now available.
- This code accompanies the paper "Information-Theoretic Probing for Linguistic Structure" published in ACL 2020.☆21Apr 27, 2020Updated 6 years ago
- ☆31Mar 4, 2024Updated 2 years ago
- Natural Language Processing Research in North American Linguistics Departments☆22Nov 13, 2025Updated 5 months ago
- ☆22Sep 25, 2023Updated 2 years ago
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings☆11Mar 14, 2022Updated 4 years ago
- Python Finite-State Toolkit☆67Apr 1, 2026Updated last month
- ☆15Apr 20, 2018Updated 8 years ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Jan 22, 2022Updated 4 years ago
- Fractional White Noises for Neural Stochastic Differential Equations (NeurIPS 2022)☆16Nov 17, 2022Updated 3 years ago
- Open source password manager - Proton Pass • AdSecurely store, share, and autofill your credentials with Proton Pass, the end-to-end encrypted password manager trusted by millions.
- ☆17Dec 11, 2024Updated last year
- Tools for understanding how transformer predictions are built layer-by-layer☆586Aug 7, 2025Updated 9 months ago
- A framework for evaluating Machine Translation models.☆12Apr 21, 2026Updated 2 weeks ago
- ☆35Jun 13, 2025Updated 10 months ago
- Efficient Scaling laws and collaborative pretraining.☆22Sep 18, 2025Updated 7 months ago
- Code for processing brain data☆12Apr 5, 2019Updated 7 years ago
- Codebase implementing LMs for learning the Dyck-(k,m) bounded hierarchical language☆16Oct 11, 2020Updated 5 years ago
- Code for the paper "Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias"☆81Aug 25, 2021Updated 4 years ago
- ☆13Dec 11, 2021Updated 4 years ago
- Deploy to Railway using AI coding agents - Free Credits Offer • AdUse Claude Code, Codex, OpenCode, and more. Autonomous software development now has the infrastructure to match with Railway.
- Model zoo for different kinds of uncertainty quantification methods used in Natural Language Processing, implemented in PyTorch.☆55May 5, 2023Updated 3 years ago
- ☆15Apr 10, 2018Updated 8 years ago
- ☆17May 25, 2020Updated 5 years ago
- ☆11Nov 20, 2020Updated 5 years ago
- ☆30Sep 3, 2025Updated 8 months ago
- ☆15Jul 1, 2020Updated 5 years ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆24Aug 15, 2025Updated 8 months ago