JonasGeiping / fullbatchtrainingLinks

Training vision models with full-batch gradient descent and regularization

☆39

Alternatives and similar repositories for fullbatchtraining

Users that are interested in fullbatchtraining are comparing it to the libraries listed below

Sorting:

MadryLab / BREEDS-Benchmarks
☆55Updated 5 years ago
wronnyhuang / gen-viz
Code for the paper "Understanding Generalization through Visualizations"
☆64Updated 4 years ago
JeanKaddour / WASAM
Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)
☆28Updated 2 years ago
tml-epfl / understanding-sam
Towards Understanding Sharpness-Aware Minimization [ICML 2022]
☆36Updated 3 years ago
mueller-mp / SAM-ON
☆34Updated last year
tml-epfl / sharpness-vs-generalization
A modern look at the relationship between sharpness and generalization [ICML 2023]
☆43Updated 2 years ago
singlasahil14 / salient_imagenet
Code for the ICLR 2022 paper. Salient Imagenet: How to discover spurious features in deep learning?
☆40Updated 3 years ago
ssagawa / overparam_spur_corr
An Investigation of Why Overparameterization Exacerbates Spurious Correlations
☆30Updated 5 years ago
yangarbiter / robust-local-lipschitz
A Closer Look at Accuracy vs. Robustness
☆88Updated 4 years ago
deep-lab / DeepnetHessian
Measurements of Three-Level Hierarchical Structure in the Outliers in the Spectrum of Deepnet Hessians (ICML 2019)
☆16Updated 6 years ago
modestyachts / imagenet-testbed
ImageNet Testbed, associated with the paper "Measuring Robustness to Natural Distribution Shifts in Image Classification."
☆119Updated 2 years ago
locuslab / robust_union
[ICML'20] Multi Steepest Descent (MSD) for robustness against the union of multiple perturbation models.
☆26Updated last year
dydjw9 / Efficient_SAM
☆58Updated 2 years ago
bneyshabur / generalization-bounds
Computing various measures and generalization bounds on convolutional and fully connected networks
☆35Updated 6 years ago
facebookresearch / BalancingGroups
Simple data balancing baselines for worst-group-accuracy benchmarks.
☆43Updated 2 years ago
harshays / simplicitybiaspitfalls
The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)
☆42Updated last year
liuchen11 / AdversaryLossLandscape
On the Loss Landscape of Adversarial Training: Identifying Challenges and How to Overcome Them [NeurIPS 2020]
☆36Updated 4 years ago
PolinaKirichenko / deep_feature_reweighting
☆109Updated 2 years ago
harshays / inputgradients
Do input gradients highlight discriminative features? [NeurIPS 2021] (https://arxiv.org/abs/2102.12781)
☆13Updated 2 years ago
mpezeshki / Gradient_Starvation
Gradient Starvation: A Learning Proclivity in Neural Networks
☆61Updated 4 years ago
davidstutz / confidence-calibrated-adversarial-training
Implementation of Confidence-Calibrated Adversarial Training (CCAT).
☆45Updated 5 years ago
ryoungj / optdom
[ICLR'22] Self-supervised learning optimally robust representations for domain shift.
☆24Updated 3 years ago
MadryLab / DebuggableDeepNetworks
☆38Updated 4 years ago
locuslab / projected_sinkhorn
☆88Updated last year
abietti / kernel_reg
Pytorch implementation of regularization methods for deep networks obtained via kernel methods.
☆22Updated 5 years ago
anniesch / jtt
Code for "Just Train Twice: Improving Group Robustness without Training Group Information"
☆72Updated last year
tding1 / Neural-Collapse
[NeurIPS 2021] A Geometric Analysis of Neural Collapse with Unconstrained Features
☆59Updated 3 years ago
hushon / JAX-ResNet-CIFAR10
Simple CIFAR10 ResNet example with JAX.
☆23Updated 4 years ago
avirmaux / lipEstimation
☆59Updated 2 years ago
yoonholee / DivDis
☆39Updated 3 years ago