devansh20la / LPF-SGD
☆17Updated 2 years ago
Alternatives and similar repositories for LPF-SGD:
Users that are interested in LPF-SGD are comparing it to the libraries listed below
- ☆11Updated 2 years ago
- ☆35Updated 2 years ago
- [ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions☆27Updated 2 months ago
- ☆34Updated last year
- [TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces☆40Updated 2 years ago
- (Pytorch) Training ResNets on ImageNet-100 data☆57Updated 3 years ago
- ☆30Updated 3 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Updated 2 years ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- An Numpy and PyTorch Implementation of CKA-similarity with CUDA support☆90Updated 3 years ago
- Transformers trained on Tiny ImageNet☆54Updated 2 years ago
- ☆86Updated 2 years ago
- ☆18Updated last year
- Official [ICLR] Code Repository for "Gradient Projection Memory for Continual Learning"☆90Updated 3 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆45Updated last year
- Implementation of Effective Sparsification of Neural Networks with Global Sparsity Constraint☆31Updated 3 years ago
- Code for the paper "A Light Recipe to Train Robust Vision Transformers" [SaTML 2023]☆52Updated 2 years ago
- ☆26Updated 3 years ago
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆56Updated 3 years ago
- Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)☆112Updated last year
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch☆16Updated 11 months ago
- ☆63Updated last year
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆57Updated 7 months ago
- ☆57Updated 2 years ago
- [ICLR 2022] Training L_inf-dist-net with faster acceleration and better training strategies☆22Updated 3 years ago
- Git Re-Basin: Merging Models modulo Permutation Symmetries in PyTorch☆75Updated 2 years ago
- ☆38Updated 6 months ago
- PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…☆191Updated last year
- ☆39Updated 2 years ago
- [Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration☆31Updated 2 years ago