d-doshi / GrokkingLinks

☆14

Alternatives and similar repositories for Grokking

Users that are interested in Grokking are comparing it to the libraries listed below

Sorting:

MadryLab / modeldiff
ModelDiff: A Framework for Comparing Learning Algorithms
☆59Updated last year
uclaml / PDE
Official repo of Progressive Data Expansion: data, code and evaluation
☆29Updated last year
xu-ji / information-bottleneck
Deep Learning & Information Bottleneck
☆61Updated 2 years ago
facebookresearch / ModelRatatouille
Recycling diverse models
☆45Updated 2 years ago
oripress / EntropyEnigma
Official code for the ICML 2024 paper "The Entropy Enigma: Success and Failure of Entropy Minimization"
☆53Updated last year
AhmedImtiazPrio / grok-adversarial
Deep Networks Grok All the Time and Here is Why
☆37Updated last year
google-deepmind / ssl_hsic
☆37Updated last year
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆37Updated 2 years ago
JeanKaddour / LAWA
Latest Weight Averaging (NeurIPS HITY 2022)
☆31Updated 2 years ago
JonasGeiping / dataaugs
☆18Updated 2 years ago
sayakpaul / robustness-foundation-models
This repository holds code and other relevant files for the NeurIPS 2022 tutorial: Foundational Robustness of Foundation Models.
☆71Updated 2 years ago
matchten / LoRA-Models-for-SAEs
Code for reproducing our paper "Low Rank Adapting Models for Sparse Autoencoder Features"
☆12Updated 4 months ago
AllanYangZhou / generative-invariance-transfer
☆26Updated 3 years ago
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated last year
ethz-spylab / superhuman-ai-consistency
☆29Updated 2 years ago
AvivNavon / DWSNets
Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]
☆89Updated last year
somepago / dbViz
The official PyTorch implementation - Can Neural Nets Learn the Same Model Twice? Investigating Reproducibility and Double Descent from t…
☆80Updated 3 years ago
facebookresearch / DejaVu
Repository for the paper Do SSL Models Have Déjà Vu? A Case of Unintended Memorization in Self-supervised Learning
☆36Updated 2 years ago
alexrame / diwa
DiWA: Diverse Weight Averaging for Out-of-Distribution Generalization
☆31Updated 2 years ago
deel-ai / Craft
👋 Code for : "CRAFT: Concept Recursive Activation FacTorization for Explainability" (CVPR 2023)
☆66Updated 2 years ago
locuslab / T-MARS
Code for T-MARS data filtering
☆35Updated last year
ShanglunFengatETHZ / PrivacyBackdoor
Privacy backdoors
☆51Updated last year
gregorbachmann / scaling_mlps
☆51Updated last year
MadryLab / bias-transfer
☆15Updated 3 years ago
Newbeeer / V-information
Code for the ICLR 2020 Paper, "A Theory of Usable Information under Computational Constraints"
☆26Updated 5 years ago
erosenfeld / disagree_discrep
Provably (and non-vacuously) bounding test error of deep neural networks under distribution shift with unlabeled test data.
☆10Updated last year
MadryLab / DebuggableDeepNetworks
☆38Updated 4 years ago
KaidiXu / LiRPA_Verify
Code for paper "Fast and Complete: Enabling Complete Neural Network Verification with Rapid and Massively Parallel Incomplete Verifiers"
☆17Updated 2 years ago
katiekang1998 / reasoning_generalization
☆34Updated 7 months ago
aks2203 / deep-thinking
A centralized place for deep thinking code and experiments
☆85Updated 2 years ago