goldblum / free-lunch
Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning
☆16Updated last year
Related projects ⓘ
Alternatives and complementary repositories for free-lunch
- ☆22Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆18Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆42Updated last year
- Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)☆79Updated last year
- NanoGPT-like codebase for LLM training☆73Updated this week
- ☆26Updated 2 weeks ago
- Deep Learning & Information Bottleneck☆50Updated last year
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆11Updated 10 months ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆16Updated last year
- ☆43Updated 9 months ago
- ☆31Updated last year
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆85Updated last year
- Data for "Datamodels: Predicting Predictions with Training Data"☆90Updated last year
- ☆59Updated 2 years ago
- ☆61Updated 2 years ago
- ☆50Updated 5 months ago
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆19Updated 11 months ago
- Investigating the generalization behavior of LM probes trained to predict truth labels: (1) from one annotator to another, and (2) from e…☆25Updated 5 months ago
- ☆59Updated 3 years ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆40Updated 6 months ago
- ☆49Updated last year
- ☆13Updated 2 months ago
- ☆25Updated 4 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆28Updated 10 months ago
- Influence Functions with (Eigenvalue-corrected) Kronecker-Factored Approximate Curvature☆100Updated 3 months ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆45Updated 9 months ago
- ☆23Updated 2 years ago
- ModelDiff: A Framework for Comparing Learning Algorithms☆53Updated last year
- Privacy backdoors☆47Updated 6 months ago
- Influence Analysis and Estimation - Survey, Papers, and Taxonomy☆63Updated 8 months ago