Rose-STL-Lab / Teleportation-Optimization
[ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries
☆29Updated 10 months ago
Alternatives and similar repositories for Teleportation-Optimization:
Users that are interested in Teleportation-Optimization are comparing it to the libraries listed below
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆48Updated 10 months ago
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated 2 years ago
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆56Updated 2 years ago
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆29Updated last year
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated 2 years ago
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆36Updated last year
- ☆42Updated last year
- Switch EMA: A Free Lunch for Better Flatness and Sharpness☆26Updated last year
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated last year
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆20Updated 6 months ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆53Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆128Updated 5 months ago
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆80Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆104Updated last year
- [CVPR 2024] Friendly Sharpness-Aware Minimization☆33Updated 5 months ago
- ☆34Updated last year
- ☆35Updated 2 years ago
- ☆52Updated 2 years ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆82Updated last year
- Data distillation benchmark☆58Updated last week
- [Preprint] GMem: A Modular Approach for Ultra-Efficient Generative Models☆33Updated last month
- ☆47Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆58Updated last year
- [ICLR'23] Trainability Preserving Neural Pruning (PyTorch)☆33Updated last year
- [NeurIPS 2023] Code release for "Going Beyond Linear Mode Connectivity: The Layerwise Linear Feature Connectivity"☆18Updated last year
- Official Implementation for PlugIn Inversion☆16Updated 3 years ago
- ☆21Updated 2 years ago
- Repository containing code for blockwise SSL training☆29Updated 6 months ago
- Official PyTorch implementation of "Meta-prediction Model for Distillation-Aware NAS on Unseen Datasets" (ICLR 2023 notable top 25%)☆24Updated last year