Training vision models with full-batch gradient descent and regularization
☆39Feb 14, 2023Updated 3 years ago
Alternatives and similar repositories for fullbatchtraining
Users that are interested in fullbatchtraining are comparing it to the libraries listed below
Sorting:
- ☆18Oct 12, 2022Updated 3 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆65Jan 15, 2021Updated 5 years ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 10 months ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- (TG'2023) Official code for the paper "Revisiting of AlphaStar" (previously called "Rethinking of AlphaStar"). It compares the raw interf…☆10Sep 6, 2021Updated 4 years ago
- Pytorch Datasets for Easy-To-Hard☆29Jan 9, 2025Updated last year
- ☆26Dec 14, 2021Updated 4 years ago
- Uncertainty-Guided Pseudo-Labelling with Model Averaging☆11Sep 24, 2024Updated last year
- Code for the paper "Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation" by Alexander Levine and Soheil Feizi.☆10Aug 22, 2022Updated 3 years ago
- Code for computing the hidden biases in deep networks and its applications☆14Feb 23, 2023Updated 3 years ago
- Code and results accompanying our paper titled Leveraging Unlabeled Data to Predict Out-of-Distribution Performance at ICLR 2022☆10Dec 8, 2022Updated 3 years ago
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Jan 13, 2023Updated 3 years ago
- ☆34Jan 25, 2024Updated 2 years ago
- ☆11Apr 23, 2021Updated 4 years ago
- ☆16Dec 9, 2023Updated 2 years ago
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)☆33Sep 28, 2025Updated 5 months ago
- ☆33Nov 27, 2023Updated 2 years ago
- Cross-library augmentation toolbox supporting 300 operators over 8 libraries + AI transforms☆12Jan 11, 2022Updated 4 years ago
- A suite of communication proxies for HPC applications☆13Jul 7, 2023Updated 2 years ago
- CVPR 2019 paper "Disentangling Adversarial Robustness and Generalization".☆14Oct 28, 2019Updated 6 years ago
- ☆19Jun 10, 2024Updated last year
- An empirical investigation of deep learning theory☆16Oct 3, 2019Updated 6 years ago
- Analyze the dynamic stability of SGD☆13Nov 25, 2018Updated 7 years ago
- Official implementation of GOAT model (ICML2023)☆38Jul 3, 2023Updated 2 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆70Sep 25, 2024Updated last year
- ☆16Jul 17, 2022Updated 3 years ago
- The official code to reproduce results from the NACCL2019 paper: White-to-Black: Efficient Distillation of Black-Box Adversarial Attacks☆12Jun 4, 2019Updated 6 years ago
- Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning☆17May 14, 2023Updated 2 years ago
- Robustness for Non-Parametric Classification: A Generic Attack and Defense☆18Nov 21, 2022Updated 3 years ago
- A modern look at the relationship between sharpness and generalization [ICML 2023]☆43Sep 11, 2023Updated 2 years ago
- The Pitfalls of Simplicity Bias in Neural Networks [NeurIPS 2020] (http://arxiv.org/abs/2006.07710v2)☆42Jan 21, 2024Updated 2 years ago
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated last year
- The official code for the publication: "The Close Relationship Between Contrastive Learning and Meta-Learning".☆18Sep 19, 2022Updated 3 years ago
- Proximal Backpropagation - a neural network training algorithm that takes implicit instead of explicit gradient steps☆42Mar 17, 2019Updated 6 years ago
- ☆24Jan 27, 2022Updated 4 years ago
- ☆23Oct 5, 2023Updated 2 years ago
- [NeurIPS 2022] Source code for our paper "Escaping Saddle Points for Effective Generalization on Class-Imbalanced Data"☆24Oct 16, 2023Updated 2 years ago
- Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training wi…☆54Nov 29, 2021Updated 4 years ago