snu-mllab / Targeted-Cause-DiscoveryLinks
Official implementation for "Targeted Cause Discovery with Data-Driven Learning"
☆23Updated 9 months ago
Alternatives and similar repositories for Targeted-Cause-Discovery
Users that are interested in Targeted-Cause-Discovery are comparing it to the libraries listed below
Sorting:
- [ICLR 2025] Monet: Mixture of Monosemantic Experts for Transformers☆69Updated this week
- ☆81Updated last year
- ☆31Updated last year
- Code for "Accelerating Training with Neuron Interaction and Nowcasting Networks" [to appear at ICLR 2025]☆19Updated last month
- PyTorch library for Active Fine-Tuning☆80Updated 4 months ago
- Sparse and discrete interpretability tool for neural networks☆63Updated last year
- Mixture of A Million Experts☆46Updated 10 months ago
- ☆53Updated 8 months ago
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆75Updated 6 months ago
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆64Updated 9 months ago
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆49Updated 9 months ago
- Model Stock: All we need is just a few fine-tuned models☆117Updated 9 months ago
- ☆79Updated 10 months ago
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆54Updated last year
- ☆46Updated 7 months ago
- Recycling diverse models☆44Updated 2 years ago
- [NeurIPS 2024] Official implementation of the paper "MambaLRP: Explaining Selective State Space Sequence Models".☆40Updated 7 months ago
- HGRN2: Gated Linear RNNs with State Expansion☆55Updated 10 months ago
- Codebase the paper "The Remarkable Robustness of LLMs: Stages of Inference?"☆18Updated 2 weeks ago
- The simplest, fastest repository for training/finetuning medium-sized GPTs.☆134Updated last week
- We study toy models of skill learning.☆28Updated 5 months ago
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆80Updated 10 months ago
- Official code for the paper "Attention as a Hypernetwork"☆39Updated last year
- Universal Neurons in GPT2 Language Models☆29Updated last year
- ☆51Updated last year
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆52Updated last year
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆66Updated 9 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆82Updated this week
- One Initialization to Rule them All: Fine-tuning via Explained Variance Adaptation☆40Updated 8 months ago