mansheej / lth_dietLinks

☆10

Alternatives and similar repositories for lth_diet

Users that are interested in lth_diet are comparing it to the libraries listed below

Sorting:

js-d / sim_metric
☆37Updated last year
ganguli-lab / degrees-of-freedom
☆37Updated 3 years ago
stanislavfort / dissect-git-re-basin
Replicating and dissecting the git-re-basin project in one-click-replication Colabs
☆36Updated 2 years ago
hlml / fortuitous_forgetting
☆19Updated 3 years ago
MadryLab / datamodels-data
Data for "Datamodels: Predicting Predictions with Training Data"
☆97Updated 2 years ago
MadryLab / datamodels
☆29Updated 2 years ago
EkdeepSLubana / MMC
Codebase for Mechanistic Mode Connectivity
☆15Updated 2 years ago
taufeeque9 / codebook-features
Sparse and discrete interpretability tool for neural networks
☆63Updated last year
borjanG / 2023-transformers
Codes for the paper The emergence of clusters in self-attention dynamics.
☆17Updated last year
ethancaballero / broken_neural_scaling_laws
Code Release for "Broken Neural Scaling Laws" (BNSL) paper
☆59Updated last year
sjunhongshen / DASH
☆23Updated 2 years ago
MadryLab / modeldiff
ModelDiff: A Framework for Comparing Learning Algorithms
☆59Updated last year
Ping-C / optimizer
This repository includes code to reproduce the tables in "Loss Landscapes are All You Need: Neural Network Generalization Can Be Explaine…
☆37Updated 2 years ago
ppope / dimensions
Code for "The Intrinsic Dimension of Images and Its Impact on Learning" - ICLR 2021 Spotlight https://openreview.net/forum?id=XJk19XzGq2J
☆69Updated last year
lawrennd / neurips2014
Notebooks for managing NeurIPS 2014 and analysing the NeurIPS experiment.
☆11Updated last year
Sea-Snell / grokking
unofficial re-implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆77Updated 3 years ago
mkhodak / relax
☆15Updated 3 years ago
JonasGeiping / dataaugs
☆18Updated 2 years ago
SamsungSAILMontreal / ghn3
Code for "Can We Scale Transformers to Predict Parameters of Diverse ImageNet Models?" [ICML 2023]
☆36Updated 10 months ago
JeanKaddour / NoTrainNoGain
Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)
☆80Updated last year
tml-epfl / sgd-sparse-features
SGD with large step sizes learns sparse features [ICML 2023]
☆32Updated 2 years ago
locuslab / edge-of-stability
☆70Updated 7 months ago
aks2203 / deep-thinking
A centralized place for deep thinking code and experiments
☆85Updated last year
google-research / jax-influence
☆60Updated 3 years ago
aks2203 / easy-to-hard-data
Pytorch Datasets for Easy-To-Hard
☆28Updated 6 months ago
aks2203 / easy-to-hard
Official repository for the paper "Can You Learn an Algorithm? Generalizing from Easy to Hard Problems with Recurrent Networks"
☆59Updated 3 years ago
facebookresearch / nbm-spam
Training and evaluating NBM and SPAM for interpretable machine learning.
☆78Updated 2 years ago
fattorib / Flax-ResNets
CIFAR10 ResNets implemented in JAX+Flax
☆12Updated 3 years ago
teddykoker / grokking
PyTorch implementation of "Grokking: Generalization Beyond Overfitting on Small Algorithmic Datasets"
☆37Updated 3 years ago
google-deepmind / conformal_training
This repository contains a Jax implementation of conformal training corresponding to the ICLR'22 paper "learning optimal conformal classi…
☆130Updated 2 years ago