technion-cs-nlp / Individual-Neurons-PitfallsLinks
☆10Updated 2 years ago
Alternatives and similar repositories for Individual-Neurons-Pitfalls
Users that are interested in Individual-Neurons-Pitfalls are comparing it to the libraries listed below
Sorting:
- ☆45Updated last year
- Code for "Tracing Knowledge in Language Models Back to the Training Data"☆38Updated 2 years ago
- Code for preprint: Summarizing Differences between Text Distributions with Natural Language☆42Updated 2 years ago
- Rationales for Sequential Predictions☆40Updated 3 years ago
- Code for paper "Leakage-Adjusted Simulatability: Can Models Generate Non-Trivial Explanations of Their Behavior in Natural Language?"☆22Updated 4 years ago
- ☆42Updated 4 years ago
- Explaining neural decisions contrastively to alternative decisions.☆25Updated 4 years ago
- A Kernel-Based View of Language Model Fine-Tuning https://arxiv.org/abs/2210.05643☆76Updated last year
- Code for the paper "Implicit Representations of Meaning in Neural Language Models"☆54Updated 2 years ago
- ☆24Updated 4 years ago
- Landing page for MIB: A Mechanistic Interpretability Benchmark☆16Updated last week
- This repository accompanies our paper “Do Prompt-Based Models Really Understand the Meaning of Their Prompts?”☆85Updated 3 years ago
- [EMNLP 2021] Dataset and PyTorch Code for ExplaGraphs: An Explanation Graph Generation Task for Structured Commonsense Reasoning☆12Updated 2 years ago
- ☆22Updated 2 years ago
- Query-focused summarization data☆42Updated 2 years ago
- Data and code for our paper "Exploring and Predicting Transferability across NLP Tasks", to appear at EMNLP 2020.☆50Updated 4 years ago
- Repo for ICML23 "Why do Nearest Neighbor Language Models Work?"☆58Updated 2 years ago
- In-context Example Selection with Influences☆15Updated 2 years ago
- This is the official implementation for the paper "Learning to Scaffold: Optimizing Model Explanations for Teaching"☆19Updated 3 years ago
- ☆38Updated last year
- PyTorch code for the RetoMaton paper: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022)☆73Updated 3 years ago
- Debiasing Methods in Natural Language Understanding Make Bias More Accessible: Code and Data☆14Updated 3 years ago
- ☆54Updated 2 years ago
- Code for EMNLP 2021 paper "Measuring Association Between Labels and Free-Text Rationales"☆12Updated last year
- ☆22Updated 3 years ago
- Code for NAACL 2022 paper "Reframing Human-AI Collaboration for Generating Free-Text Explanations"☆31Updated 2 years ago
- ☆58Updated 3 years ago
- ☆89Updated 2 months ago
- Automatic metrics for GEM tasks☆66Updated 2 years ago
- DEMix Layers for Modular Language Modeling☆53Updated 3 years ago