Compositional Explanations of Neurons, NeurIPS 2020 https://arxiv.org/abs/2006.14032
☆25Apr 9, 2021Updated 4 years ago
Alternatives and similar repositories for compexp
Users that are interested in compexp are comparing it to the libraries listed below
Sorting:
- ☆10Jul 24, 2023Updated 2 years ago
- Codebase for Mechanistic Mode Connectivity☆13Jul 14, 2023Updated 2 years ago
- Code for Net2Vec: Quantifying and Explaining how Concepts are Encoded by Filters in Deep Neural Networks☆30Feb 8, 2018Updated 8 years ago
- Experiments from "The Generalization-Stability Tradeoff in Neural Network Pruning": https://arxiv.org/abs/1906.03728.☆14Oct 23, 2020Updated 5 years ago
- Code for Evaluating Explanations for Reading Comprehension with Realistic Counterfactuals.☆18Apr 25, 2021Updated 4 years ago
- Code for the paper "A Sea of Words: An In-Depth Analysis of Anchors for Text Data", AISTATS 2023☆14Oct 26, 2024Updated last year
- Explanation Optimization☆13Oct 16, 2020Updated 5 years ago
- Scalable Automatic Visual Summarization of Concepts in Deep Neural Networks☆18May 11, 2022Updated 3 years ago
- Code of LeCoRE☆13Feb 15, 2023Updated 3 years ago
- Entity Evaluation code☆21Nov 6, 2019Updated 6 years ago
- Python library for analyzing the internal structure of deep neural networks.☆19Feb 9, 2026Updated last month
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆52Sep 24, 2024Updated last year
- Code for ModularQA☆28Jun 8, 2021Updated 4 years ago
- Code for paper "When Can Models Learn From Explanations? A Formal Framework for Understanding the Roles of Explanation Data"☆14Feb 16, 2021Updated 5 years ago
- ☆13Jul 26, 2023Updated 2 years ago
- Efficient and Effective Weight-Ensembling Mixture of Experts for Multi-Task Model Merging. Arxiv, 2024.☆16Oct 28, 2024Updated last year
- ☆16Aug 7, 2024Updated last year
- Emergent Communication of Generalizations, NeurIPS 2021☆13Sep 29, 2021Updated 4 years ago
- Provable Worst Case Guarantees for the Detection of Out-of-Distribution Data☆13Sep 20, 2022Updated 3 years ago
- Causal Reasoning for Membership Inference Attacks☆11Oct 21, 2022Updated 3 years ago
- ☆20Oct 12, 2021Updated 4 years ago
- ☆13Sep 27, 2022Updated 3 years ago
- Official codes for NAACL 2025 paper "LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias …☆11Nov 25, 2025Updated 3 months ago
- Light version of Network Dissection for Quantifying Interpretability of Networks☆221May 6, 2019Updated 6 years ago
- ☆19Oct 27, 2021Updated 4 years ago
- ☆96Oct 27, 2022Updated 3 years ago
- ☆11Mar 26, 2020Updated 5 years ago
- This is the implementation of paper "Learning to Ask Conversational Questions by Optimizing Levenshtein Distance".☆10Jul 5, 2021Updated 4 years ago
- Official PyTorch implementation for "Understanding Instance-based Interpretability of Variational Auto-Encoders."☆13Oct 21, 2021Updated 4 years ago
- ☆15Mar 12, 2024Updated 2 years ago
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data☆14Apr 24, 2022Updated 3 years ago
- ☆13Feb 14, 2022Updated 4 years ago
- Python wrapper for the FrameNet library.☆24Jul 26, 2011Updated 14 years ago
- NILE : Natural Language Inference with Faithful Natural Language Explanations☆29Jun 12, 2023Updated 2 years ago
- Dataset & Code for Com2Sense Benchmark☆13Sep 8, 2021Updated 4 years ago
- An implementation of the Llama architecture, to instruct and delight☆21May 31, 2025Updated 9 months ago
- ML Benchmarks in Algebraic Combinatorics☆24Jan 15, 2026Updated 2 months ago
- Infinite relational model (IRM) for datamicroscopes☆14Oct 26, 2015Updated 10 years ago
- The KiloGram Tangrams dataset☆58Apr 25, 2025Updated 10 months ago