fjzzq2002 / pizzaLinks

Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"

☆17

Alternatives and similar repositories for pizza

Users that are interested in pizza are comparing it to the libraries listed below

Sorting:

bilal-chughtai / rep-theory-mech-interp
☆26Updated 2 years ago
SamsungSAILMontreal / ghn3
Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]
☆36Updated 10 months ago
IdoAmos / not-from-scratch
☆32Updated 8 months ago
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆61Updated 2 years ago
KindXiaoming / Omnigrok
Omnigrok: Grokking Beyond Algorithmic Data
☆58Updated 2 years ago
MadryLab / modeldiff
ModelDiff: A Framework for Comparing Learning Algorithms
☆59Updated last year
snap-stanford / zeroc
ZeroC is a neuro-symbolic method that trained with elementary visual concepts and relations, can zero-shot recognize and acquire more com…
☆32Updated 2 years ago
mechanistic-interpretability-grokking / progress-measures-paper
☆68Updated 2 years ago
epfml / schedules-and-scaling
Code for NeurIPS 2024 Spotlight: "Scaling Laws and Compute-Optimal Training Beyond Fixed Training Durations"
☆75Updated 8 months ago
gregorbachmann / Next-Token-Failures
☆87Updated last year
locuslab / edge-of-stability
☆70Updated 7 months ago
abhishekpanigrahi1996 / transformer_in_transformer
☆45Updated last year
Silent-Zebra / twisted-smc-lm
☆29Updated 3 months ago
automl / is_mamba_capable_of_icl
☆18Updated last year
GFNOrg / GFN_vs_HVI
☆9Updated 2 years ago
bhoov / energy-transformer-jax
The Energy Transformer block, in JAX
☆59Updated last year
deep-symbolic-mathematics / llm-srbench
[ICML2025 Oral] LLM-SRBench: A New Benchmark for Scientific Equation Discovery with Large Language Models
☆52Updated last month
hartvigsen-group / composable-interventions
☆28Updated 4 months ago
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
Johswald / awesome-hypernetworks
☆65Updated 3 years ago
Sea-Snell / grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆77Updated 3 years ago
RobertCsordas / modules
The official repository for our paper "Are Neural Nets Modular? Inspecting Functional Modularity Through Differentiable Weight Masks". We…
☆46Updated last year
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
ejmichaud / grokking-squared
☆26Updated 2 years ago
KihoPark / linear_rep_geometry
☆100Updated 5 months ago
vedantpalit / Towards-Vision-Language-Mechanistic-Interpretability
This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…
☆22Updated last year
chenweize1998 / fully-hyperbolic-nn
Code for paper Fully Hyperbolic Neural Networks
☆79Updated 2 years ago
jysohn1108 / Looped-Transformer
Official implementation of the transformer (TF) architecture suggested in a paper entitled "Looped Transformers as Programmable Computers…
☆27Updated 2 years ago
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆37Updated 2 years ago
adamkarvonen / SAE_BoardGameEval
☆23Updated 5 months ago