Rose-STL-Lab / Teleportation-OptimizationLinks
[ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries
☆30Updated last year
Alternatives and similar repositories for Teleportation-Optimization
Users that are interested in Teleportation-Optimization are comparing it to the libraries listed below
Sorting:
- This repository is the implementation of the paper Training Free Pretrained Model Merging (CVPR2024).☆28Updated last year
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆44Updated last year
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆56Updated 2 years ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated 2 years ago
- ☆45Updated 2 years ago
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆33Updated last year
- Repository containing code for blockwise SSL training☆29Updated 7 months ago
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆48Updated last year
- Sharpness-Aware Minimization Leads to Low-Rank Features [NeurIPS 2023]☆28Updated last year
- Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)☆47Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆128Updated 6 months ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆39Updated 2 years ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆25Updated 3 months ago
- [NeurIPS'24] Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization☆32Updated 8 months ago
- Official implementation for Equivariant Architectures for Learning in Deep Weight Spaces [ICML 2023]☆89Updated last year
- [NeurIPS 2024] VeLoRA : Memory Efficient Training using Rank-1 Sub-Token Projections☆20Updated 7 months ago
- ☆58Updated 2 years ago
- A curated list of Model Merging methods.☆92Updated 8 months ago
- Code for ICML2023 paper, DDGR: Continual Learning with Deep Diffusion-based Generative Replay.☆37Updated last year
- ☆38Updated last year
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆58Updated last year
- ☆43Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆105Updated last year
- Data distillation benchmark☆64Updated this week
- ☆35Updated last year
- ☆109Updated last year
- ☆35Updated 2 years ago
- ICLR 2024, Towards Lossless Dataset Distillation via Difficulty-Aligned Trajectory Matching☆101Updated last year
- [ICLR 2022] "Anti-Oversmoothing in Deep Vision Transformers via the Fourier Domain Analysis: From Theory to Practice" by Peihao Wang, Wen…☆79Updated last year
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆57Updated last year