d-doshi / Grokking
☆13Updated 2 weeks ago
Alternatives and similar repositories for Grokking:
Users that are interested in Grokking are comparing it to the libraries listed below
- Deep Learning & Information Bottleneck☆58Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆20Updated 3 years ago
- Omnigrok: Grokking Beyond Algorithmic Data☆53Updated 2 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ModelDiff: A Framework for Comparing Learning Algorithms☆55Updated last year
- ☆41Updated 2 years ago
- Pytorch Datasets for Easy-To-Hard☆27Updated 2 months ago
- ☆14Updated last year
- Privacy backdoors☆51Updated 10 months ago
- Understanding Rare Spurious Correlations in Neural Network☆12Updated 2 years ago
- ☆49Updated last year
- ☆53Updated 2 years ago
- ☆17Updated 2 years ago
- ☆27Updated last year
- Official repository for our paper, Transformers Learn Higher-Order Optimization Methods for In-Context Learning: A Study with Linear Mode…☆15Updated 3 months ago
- ☆31Updated 5 months ago
- Spurious Features Everywhere - Large-Scale Detection of Harmful Spurious Features in ImageNet☆30Updated last year
- Replicating and dissecting the git-re-basin project in one-click-replication Colabs☆36Updated 2 years ago
- [NeurIPS 2023] and [ICLR 2024] for robustness certification.☆9Updated 3 months ago
- Official Repository for ICML 2023 paper "Can Neural Network Memorization Be Localized?"☆17Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆35Updated 2 years ago
- Computationally friendly hyper-parameter search with DP-SGD☆24Updated 2 months ago
- ☆34Updated last year
- Code for the paper "Evading Black-box Classifiers Without Breaking Eggs" [SaTML 2024]☆20Updated 11 months ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆35Updated 2 years ago
- Deep Networks Grok All the Time and Here is Why☆29Updated 9 months ago
- ☆16Updated last year
- Code for the paper "The Journey, Not the Destination: How Data Guides Diffusion Models"☆22Updated last year
- Intriguing Properties of Data Attribution on Diffusion Models (ICLR 2024)☆28Updated last year