Training vision models with full-batch gradient descent and regularization
☆39Feb 14, 2023Updated 3 years ago
Alternatives and similar repositories for fullbatchtraining
Users that are interested in fullbatchtraining are comparing it to the libraries listed below
Sorting:
- ☆18Oct 12, 2022Updated 3 years ago
- Algorithms for approximate attention in LLMs☆21Apr 14, 2025Updated 11 months ago
- Generating Potent Poisons and Backdoors from Scratch with Guided Diffusion☆11Apr 1, 2024Updated last year
- A simple and efficient baseline for data attribution☆11Nov 10, 2023Updated 2 years ago
- Code for the paper "Understanding Generalization through Visualizations"☆65Jan 15, 2021Updated 5 years ago
- Pytorch Datasets for Easy-To-Hard☆29Jan 9, 2025Updated last year
- ☆13Mar 22, 2023Updated 2 years ago
- Pytorch ImageNet1k Loader with Bounding Boxes.☆13Jan 23, 2022Updated 4 years ago
- Gemstones: A Model Suite for Multi-Faceted Scaling Laws (NeurIPS 2025)☆33Sep 28, 2025Updated 5 months ago
- Implementation of experiments from The No Free Lunch Theorem, Kolmogorov Complexity, and the Role of Inductive Biases in Machine Learning☆17May 14, 2023Updated 2 years ago
- ☆26Dec 14, 2021Updated 4 years ago
- ☆16Jul 17, 2022Updated 3 years ago
- Towards Understanding Sharpness-Aware Minimization [ICML 2022]☆38Jun 14, 2022Updated 3 years ago
- An empirical investigation of deep learning theory☆16Oct 3, 2019Updated 6 years ago
- ☆33Nov 27, 2023Updated 2 years ago
- Official implementation of GOAT model (ICML2023)☆38Jul 3, 2023Updated 2 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Jan 13, 2023Updated 3 years ago
- ☆15Oct 18, 2024Updated last year
- ☆24Jan 27, 2022Updated 4 years ago
- ☆34Jan 25, 2024Updated 2 years ago
- Analyze the dynamic stability of SGD☆13Nov 25, 2018Updated 7 years ago
- Code for computing the hidden biases in deep networks and its applications☆14Feb 23, 2023Updated 3 years ago
- Why Do We Need Weight Decay in Modern Deep Learning? [NeurIPS 2024]☆71Sep 25, 2024Updated last year
- Cross-library augmentation toolbox supporting 300 operators over 8 libraries + AI transforms☆12Jan 11, 2022Updated 4 years ago
- What do we learn from inverting CLIP models?☆58Mar 6, 2024Updated 2 years ago
- Code for Active Learning at The ImageNet Scale. This repository implements many popular active learning algorithms and allows training wi…☆54Nov 29, 2021Updated 4 years ago
- [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"☆20Apr 21, 2024Updated last year
- ☆11Apr 23, 2021Updated 4 years ago
- Code and results accompanying our paper titled Leveraging Unlabeled Data to Predict Out-of-Distribution Performance at ICLR 2022☆10Dec 8, 2022Updated 3 years ago
- A suite of communication proxies for HPC applications☆13Jul 7, 2023Updated 2 years ago
- ☆23Oct 5, 2023Updated 2 years ago
- ☆12Oct 20, 2023Updated 2 years ago
- (TG'2023) Official code for the paper "Revisiting of AlphaStar" (previously called "Rethinking of AlphaStar"). It compares the raw interf…☆10Sep 6, 2021Updated 4 years ago
- Code for the paper "Robustness Certificates for Sparse Adversarial Attacks by Randomized Ablation" by Alexander Levine and Soheil Feizi.☆10Aug 22, 2022Updated 3 years ago
- ☆19Jun 10, 2024Updated last year
- Uncertainty-Guided Pseudo-Labelling with Model Averaging☆11Updated this week
- ☆57Feb 13, 2023Updated 3 years ago
- ☆11Jan 2, 2026Updated 2 months ago
- Official repo for Detecting, Explaining, and Mitigating Memorization in Diffusion Models (ICLR 2024)☆78Apr 3, 2024Updated last year