d-doshi / GrokkingLinks
☆14Updated 5 months ago
Alternatives and similar repositories for Grokking
Users that are interested in Grokking are comparing it to the libraries listed below
Sorting:
- ModelDiff: A Framework for Comparing Learning Algorithms☆59Updated last year
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated last year
- Deep Learning & Information Bottleneck☆61Updated 2 years ago
- Recycling diverse models☆45Updated 2 years ago
- Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"☆53Updated last year
- Deep Networks Grok All the Time and Here is Why☆37Updated last year
- ☆37Updated last year
- This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…☆37Updated 2 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆31Updated 2 years ago
- ☆18Updated 2 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆71Updated 2 years ago
- Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"☆12Updated 4 months ago
- ☆26Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- ☆29Updated 2 years ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated last year
- The official PyTorch implementation - Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from t…☆80Updated 3 years ago
- Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning☆36Updated 2 years ago
- DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization☆31Updated 2 years ago
- 👋 Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)☆66Updated 2 years ago
- Code for T-MARS data filtering☆35Updated last year
- Privacy backdoors☆51Updated last year
- ☆51Updated last year
- ☆15Updated 3 years ago
- Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"☆26Updated 5 years ago
- Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.☆10Updated last year
- ☆38Updated 4 years ago
- Code for paper "Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers"☆17Updated 2 years ago
- ☆34Updated 7 months ago
- A centralized place for deep thinking code and experiments☆85Updated 2 years ago