snu-mllab / Targeted-Cause-Discovery
Official implementation for "Targeted Cause Discovery with Data-Driven Learning"
☆20Updated 3 weeks ago
Related projects: ⓘ
- ☆25Updated 4 months ago
- Official implementation of MAIA, A Multimodal Automated Interpretability Agent☆56Updated last month
- Explorations into the proposal from the paper "Grokfast, Accelerated Grokking by Amplifying Slow Gradients"☆82Updated 3 weeks ago
- ☆73Updated 5 months ago
- Official implementation of Phi-Mamba. A MOHAWK-distilled model (Transformers to SSMs: Distilling Quadratic Knowledge to Subquadratic Mode…☆61Updated this week
- ☆42Updated 3 months ago
- The official repository for HyperZ⋅Z⋅W Operator Connects Slow-Fast Networks for Full Context Interaction.☆29Updated this week
- ☆47Updated 3 months ago
- PyTorch implementation of models from the Zamba2 series.☆63Updated last month
- Implementation of the general framework for AMIE, from the paper "Towards Conversational Diagnostic AI", out of Google Deepmind☆51Updated this week
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆44Updated 3 months ago
- Model Stock: All we need is just a few fine-tuned models☆75Updated 5 months ago
- Pytorch implementation of a simple way to enable (Stochastic) Frame Averaging for any network☆45Updated last month
- Code for reproducing our paper "Not All Language Model Features Are Linear"☆57Updated last week
- ☆20Updated 5 months ago
- Implementation of Infini-Transformer in Pytorch☆100Updated last month
- [ACL 2024 Findings & ICLR 2024 WS] An Evaluator VLM that is open-source, offers reproducible evaluation, and inexpensive to use. Specific…☆51Updated last week
- ☆61Updated 2 months ago
- Personal implementation of ASIF by Antonio Norelli☆23Updated 3 months ago
- Official implementation of FIND (NeurIPS '23) Function Interpretation Benchmark and Automated Interpretability Agents☆42Updated 5 months ago
- Towards Understanding the Mixture-of-Experts Layer in Deep Learning☆19Updated 9 months ago
- Repository for the paper Stream of Search: Learning to Search in Language☆70Updated last month
- Yet another random morning idea to be quickly tried and architecture shared if it works; to allow the transformer to pause for any amount…☆50Updated 10 months ago
- ☆48Updated 3 months ago
- ☆22Updated last week
- WIP☆76Updated last month
- ☆38Updated 8 months ago
- ☆36Updated last month
- Simple implementation of muP, based on Spectral Condition for Feature Learning. The implementation is SGD only, dont use it for Adam☆66Updated last month
- These papers will provide unique insightful concepts that will broaden your perspective on neural networks and deep learning☆44Updated last year