devansh20la / LPF-SGDLinks
☆17Updated 3 years ago
Alternatives and similar repositories for LPF-SGD
Users that are interested in LPF-SGD are comparing it to the libraries listed below
Sorting:
- ☆32Updated 3 years ago
- ☆11Updated 3 years ago
- ☆35Updated 3 years ago
- [TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces☆42Updated 3 years ago
- Transformers trained on Tiny ImageNet☆58Updated 3 months ago
- [AAAI-2022] Up to 100x Faster Data-free Knowledge Distillation☆75Updated 3 years ago
- PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…☆201Updated last year
- [ICLR 2023] Trainable Weight Averaging: Efficient Training by Optimizing Historical Solutions☆27Updated 10 months ago
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated 2 years ago
- Reproduce CKA: Similarity of Neural Network Representations Revisited☆311Updated 5 years ago
- An Numpy and PyTorch Implementation of CKA-similarity with CUDA support☆94Updated 4 years ago
- In progress.☆67Updated last year
- (Pytorch) Training ResNets on ImageNet-100 data☆63Updated 3 years ago
- [ICML 2021] "Do We Actually Need Dense Over-Parameterization? In-Time Over-Parameterization in Sparse Training" by Shiwei Liu, Lu Yin, De…☆45Updated 2 years ago
- Official PyTorch implementation of "Dataset Condensation via Efficient Synthetic-Data Parameterization" (ICML'22)☆115Updated 2 years ago
- [IJCAI'22 Survey] Recent Advances on Neural Network Pruning at Initialization.☆59Updated 2 years ago
- pytorch-tiny-imagenet☆187Updated this week
- ☆34Updated last year
- Lightweight torch implementation of rigl, a sparse-to-sparse optimizer.☆60Updated 4 years ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- [Neurips 2021] Sparse Training via Boosting Pruning Plasticity with Neuroregeneration☆31Updated 2 years ago
- PyTorch implementation of Mixer-nano (#parameters is 0.67M, originally Mixer-S/16 has 18M) with 90.83 % acc. on CIFAR-10. Training from s…☆36Updated 4 years ago
- [IJCAI-2021] Contrastive Model Inversion for Data-Free Knowledge Distillation☆73Updated 3 years ago
- Compare neural networks by their feature similarity☆376Updated 2 years ago
- ☆89Updated 2 years ago
- Weight-Averaged Sharpness-Aware Minimization (NeurIPS 2022)☆28Updated 2 years ago
- Implementation of Effective Sparsification of Neural Networks with Global Sparsity Constraint☆31Updated 3 years ago
- Comparison of method "Pruning at initialization prior to training" (Synflow/SNIP/GraSP) in PyTorch☆17Updated last year
- A PyTorch implementation of Centered Kernel Alignment (CKA) with GPU acceleration.☆54Updated last year
- ☆69Updated 2 years ago