RylanSchaeffer / Stanford-AI-Alignment-Double-Descent-TutorialLinks
Code for Arxiv Double Descent Demystified: Identifying, Interpreting & Ablating the Sources of a Deep Learning Puzzle
☆26Updated last year
Alternatives and similar repositories for Stanford-AI-Alignment-Double-Descent-Tutorial
Users that are interested in Stanford-AI-Alignment-Double-Descent-Tutorial are comparing it to the libraries listed below
Sorting:
- ☆26Updated 2 years ago
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- ☆29Updated last year
- Deep Networks Grok All the Time and Here is Why☆36Updated last year
- ☆16Updated last year
- Unofficial implementation of Conformal Language Modeling by Quach et al☆28Updated last year
- ☆18Updated 3 months ago
- Repo for solving arc problems with an Neural Cellular Automata☆15Updated 2 weeks ago
- Understanding how features learned by neural networks evolve throughout training☆34Updated 7 months ago
- Attribution-based Parameter Decomposition☆23Updated last week
- ModelDiff: A Framework for Comparing Learning Algorithms☆56Updated last year
- Code repository for "The Clock and the Pizza: Two Stories in Mechanistic Explanation of Neural Networks"☆16Updated last year
- we got you bro☆35Updated 10 months ago
- ☆28Updated 3 months ago
- ☆18Updated 2 years ago
- ☆66Updated 2 years ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated last year
- Recycling diverse models☆44Updated 2 years ago
- ☆53Updated 8 months ago
- gzip Predicts Data-dependent Scaling Laws☆35Updated last year
- Code for the paper "Data Feedback Loops: Model-driven Amplification of Dataset Biases"☆16Updated 2 years ago
- Implementation of Influence Function approximations for differently sized ML models, using PyTorch☆15Updated last year
- Implementation for robust ViT and scaled attention☆19Updated 2 months ago
- Personal implementation of ASIF by Antonio Norelli☆25Updated last year
- This is the official repository for the "Towards Vision-Language Mechanistic Interpretability: A Causal Tracing Tool for BLIP" paper acce…☆22Updated last year
- Codebase for Mechanistic Mode Connectivity☆14Updated last year
- Code for our paper "Decomposing The Dark Matter of Sparse Autoencoders"☆22Updated 4 months ago
- Sparse Autoencoder Training Library☆52Updated last month
- Omnigrok: Grokking Beyond Algorithmic Data☆58Updated 2 years ago
- Code for experiments on transformers using Markovian data.☆14Updated 6 months ago