fjzzq2002 / pizza
Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"
☆15Updated last year
Alternatives and similar repositories for pizza:
Users that are interested in pizza are comparing it to the libraries listed below
- ☆12Updated last year
- ☆31Updated 6 months ago
- ☆22Updated 3 months ago
- ☆25Updated 2 years ago
- ☆34Updated last year
- ☆84Updated last year
- ☆45Updated last year
- ☆28Updated last month
- ☆64Updated 2 years ago
- Sparse Autoencoder Training Library☆49Updated last week
- Parallelizing non-linear sequential models over the sequence length☆51Updated 3 months ago
- ☆92Updated 3 months ago
- The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…☆46Updated last year
- The Energy Transformer block, in JAX☆57Updated last year
- ☆18Updated last year
- ☆94Updated last year
- ☆26Updated 2 years ago
- Self-Supervised Alignment with Mutual Information☆18Updated 11 months ago
- ☆80Updated last year
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆34Updated last month
- [ICML2025 Spotlight] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models☆22Updated this week
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆36Updated 2 years ago
- The accompanying code for "Transformer Feed-Forward Layers Are Key-Value Memories". Mor Geva, Roei Schuster, Jonathan Berant, and Omer Le…☆91Updated 3 years ago
- ☆27Updated 9 months ago
- ☆25Updated last year
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆16Updated 5 months ago
- ☆14Updated last year
- ☆67Updated 5 months ago
- maze datasets for investigating OOD behavior of ML systems☆45Updated 2 weeks ago
- Code for "Echo Chamber: RL Post-training Amplifies Behaviors Learned in Pretraining"☆15Updated 3 weeks ago