d-doshi / GrokkingLinks
☆15Updated 9 months ago
Alternatives and similar repositories for Grokking
Users that are interested in Grokking are comparing it to the libraries listed below
Sorting:
- ModelDiff: A Framework for Comparing Learning Algorithms☆58Updated 2 years ago
- Recycling diverse models☆46Updated 2 years ago
- Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"☆28Updated 5 years ago
- ☆38Updated last year
- ☆18Updated 3 years ago
- Official repo of Progressive Data Expansion: data, code and evaluation☆29Updated 2 years ago
- Source code of "Hold me tight! Influence of discriminative features on deep network boundaries"☆21Updated 3 years ago
- Deep Learning & Information Bottleneck☆62Updated 2 years ago
- This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.☆72Updated 2 years ago
- ☆38Updated 4 years ago
- ☆36Updated 3 years ago
- ☆96Updated 3 years ago
- Code release for REPAIR: REnormalizing Permuted Activations for Interpolation Repair☆51Updated last year
- ☆24Updated 4 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated 2 years ago
- ☆19Updated 3 years ago
- The official PyTorch implementation - Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from t…☆83Updated 3 years ago
- ☆55Updated 5 years ago
- This repository contains the code of the distribution shift framework presented in A Fine-Grained Analysis on Distribution Shift (Wiles e…☆84Updated last month
- Code for paper "Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers"☆17Updated 2 years ago
- A centralized place for deep thinking code and experiments☆87Updated 2 years ago
- Dataset Interfaces: Diagnosing Model Failures Using Controllable Counterfactual Generation☆45Updated 2 years ago
- ☆26Updated 3 years ago
- ☆46Updated 2 years ago
- Deep Networks Grok All the Time and Here is Why☆38Updated last year
- Distilling Model Failures as Directions in Latent Space☆47Updated 2 years ago
- Code for NeurIPS'23 paper "A Bayesian Approach To Analysing Training Data Attribution In Deep Learning"☆17Updated last year
- ☆15Updated 3 years ago
- Official code repository for Correct-N-Contrast☆23Updated 3 years ago
- Latest Weight Averaging (NeurIPS HITY 2022)☆31Updated 2 years ago