A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆108Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for NeuroX
Users that are interested in NeuroX are comparing it to the libraries listed below. We may earn a commission when you buy through links labeled 'Ad' on this page.
Sorting:
- This is a repository with the code for the EMNLP 2020 paper "Information-Theoretic Probing with Minimum Description Length"☆71Aug 20, 2024Updated last year
- Making a bridge between NLP models and Brain data☆19Jun 3, 2020Updated 5 years ago
- Mechanistic Interpretability for Transformer Models☆53Jun 1, 2022Updated 3 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆51Nov 30, 2024Updated last year
- Managed hosting for WordPress and PHP on Cloudways • AdManaged hosting for WordPress, Magento, Laravel, or PHP apps, on multiple cloud providers. Deploy in minutes on Cloudways by DigitalOcean.
- EMNLP 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer☆13Mar 23, 2025Updated last year
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆14Apr 14, 2021Updated 5 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆20May 19, 2022Updated 3 years ago
- Teaching Models to Express Their Uncertainty in Words☆38May 26, 2022Updated 3 years ago
- Repository describing example random control tasks for designing and interpreting neural probes☆32Jun 21, 2022Updated 3 years ago
- This code accompanies the paper "Information-Theoretic Probing for Linguistic Structure" published in ACL 2020.☆21Apr 27, 2020Updated 5 years ago
- ☆31Mar 4, 2024Updated 2 years ago
- Natural Language Processing Research in North American Linguistics Departments☆22Nov 13, 2025Updated 5 months ago
- Serverless GPU API endpoints on Runpod - Bonus Credits • AdSkip the infrastructure headaches. Auto-scaling, pay-as-you-go, no-ops approach lets you focus on innovating your application.
- Stanford NLP Python library for understanding and improving PyTorch models via interventions☆870Mar 6, 2026Updated last month
- Steering Vector Repo from "Extracting Latent Steering Vectors from Pretrained Language Models" - ACL2022 Findings