fdalvi / NeuroXView external linksLinks
A Python library that encapsulates various methods for neuron interpretation and analysis in Deep NLP models.
☆106Oct 4, 2023Updated 2 years ago
Alternatives and similar repositories for NeuroX
Users that are interested in NeuroX are comparing it to the libraries listed below
Sorting:
- Mechanistic Interpretability for Transformer Models☆53Jun 1, 2022Updated 3 years ago
- Explicit Alignment Objectives for Multilingual Bidirectional Encoders☆14Apr 14, 2021Updated 4 years ago
- Making a bridge between NLP models and Brain data☆18Jun 3, 2020Updated 5 years ago
- CausalGym: Benchmarking causal interpretability methods on linguistic tasks☆51Nov 30, 2024Updated last year
- ☆38Apr 23, 2019Updated 6 years ago
- ☆15Apr 10, 2018Updated 7 years ago
- Overview of corpora/datasets for Germanic low-resource languages and dialects. Accompanies "A Survey of Corpora for Germanic Low-Resource…☆27Updated this week
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19May 19, 2022Updated 3 years ago
- A framework for evaluating Machine Translation models.☆12May 26, 2025Updated 8 months ago
- [ACL 2023] Counterspeeches up my sleeve! Intent Distribution Learning and Persistent Fusion for Intent-Conditioned Counterspeech Generati…☆10Sep 23, 2023Updated 2 years ago
- [ICML 2023] Variational Curriculum Reinforcement Learning for Unsupervised Discovery of Skills☆12Jul 15, 2023Updated 2 years ago
- Code for the NAACL 2024 HCI+NLP Workshop paper "LLMCheckup: Conversational Examination of Large Language Models via Interpretability Tool…☆13Mar 24, 2024Updated last year
- Code for processing brain data☆12Apr 5, 2019Updated 6 years ago
- Keeping language models honest by directly eliciting knowledge encoded in their activations.☆217Jan 26, 2026Updated 2 weeks ago
- A multi-species, multi-ephys format histology processing and probe alignment pipeline☆13Mar 20, 2024Updated last year
- A Dataset and Results for Classifying Emotions Across Languages☆10Jun 20, 2021Updated 4 years ago
- Applies ROME and MEMIT on Mamba-S4 models☆14Apr 5, 2024Updated last year
- ☆11Dec 1, 2020Updated 5 years ago
- Code to reproduce key results accompanying "SAEs (usually) Transfer Between Base and Chat Models"☆13Jul 18, 2024Updated last year
- ☆15Apr 20, 2018Updated 7 years ago
- Dual optimization to learn laplacian eigenpairs in arbitrary spaces☆16Dec 18, 2024Updated last year
- A2T: Towards Improving Adversarial Training of NLP Models (EMNLP 2021 Findings)☆26Sep 12, 2021Updated 4 years ago
- PHP low-level client for Vespa. https://vespa.ai/☆17Jan 22, 2026Updated 3 weeks ago
- DreamGaussian with 2D-GS☆12Oct 10, 2024Updated last year
- ☆10Jul 27, 2018Updated 7 years ago
- Code for Analyzing Redundancy in Pretrained Transformer Models accepted at EMNLP 2020☆14Oct 6, 2020Updated 5 years ago
- A quick way to get started with Transformer Lens☆14Dec 13, 2023Updated 2 years ago
- Repository describing example random control tasks for designing and interpreting neural probes☆32Jun 21, 2022Updated 3 years ago
- ☆15Apr 2, 2025Updated 10 months ago
- Getting interpretable dimensions in word embedding spaces.☆15Jul 6, 2023Updated 2 years ago
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data☆14Apr 24, 2022Updated 3 years ago
- This is the code for our ACL 2021 paper entitled eMLM: A New Pre-training Objective for Emotion Related Tasks☆15Sep 7, 2022Updated 3 years ago
- ☆15Jul 1, 2020Updated 5 years ago
- Official code implementation for the paper "Do Vision & Language Decoders use Images and Text equally? How Self-consistent are their Expl…☆12Apr 4, 2025Updated 10 months ago
- ☆19Sep 16, 2025Updated 4 months ago
- Emotion-Aware Dialogue Response Generation by Multi-Task Learning☆13Jan 22, 2022Updated 4 years ago
- ☆207Oct 14, 2025Updated 3 months ago
- Open source replication of Anthropic's Crosscoders for Model Diffing☆64Oct 27, 2024Updated last year
- (NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations☆15Apr 14, 2025Updated 9 months ago