d-doshi / Grokking
☆11Updated 4 months ago
Alternatives and similar repositories for Grokking:
Users that are interested in Grokking are comparing it to the libraries listed below
- Pytorch Datasets for Easy-To-Hard☆26Updated last week
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Updated last year
- A centralized place for deep thinking code and experiments☆78Updated last year
- ☆30Updated 3 months ago
- Computationally friendly hyper-parameter search with DP-SGD☆23Updated last week
- ☆17Updated 2 years ago
- ☆38Updated 3 years ago
- Deep Learning & Information Bottleneck☆53Updated last year
- ☆55Updated 4 years ago
- ☆51Updated last year
- The official repository of the paper "On the Exploitability of Instruction Tuning".☆58Updated 11 months ago
- ☆16Updated last year
- ☆39Updated last year
- Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)☆13Updated 2 years ago
- Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning☆16Updated last year
- Source code of "What can linearized neural networks actually say about generalization?☆19Updated 3 years ago
- ☆33Updated last year
- ☆27Updated 6 months ago
- ☆30Updated last month
- ☆13Updated 10 months ago
- ☆58Updated 3 years ago
- [NeurIPS 2023] and [ICLR 2024] for robustness certification.☆9Updated last month
- Data for "Datamodels: Predicting Predictions with Training Data"☆94Updated last year
- ☆58Updated 2 years ago
- This repository contains the code used for the experiments in the paper "Fine-Tuning Enhances Existing Mechanisms: A Case Study on Entity…☆21Updated 9 months ago
- Starter kit and data loading code for the Trojan Detection Challenge NeurIPS 2022 competition☆33Updated last year
- ☆15Updated 10 months ago
- Independent robustness evaluation of Improving Alignment and Robustness with Short Circuiting☆13Updated 5 months ago
- Code accompanying our paper "Feature Learning in Infinite-Width Neural Networks" (https://arxiv.org/abs/2011.14522)☆58Updated 3 years ago
- Privacy backdoors☆51Updated 8 months ago