Rose-STL-Lab / Teleportation-Optimization
[ICLR 2024] Improving Convergence and Generalization Using Parameter Symmetries
☆26Updated 3 months ago
Related projects: ⓘ
- Metrics for "Beyond neural scaling laws: beating power law scaling via data pruning " (NeurIPS 2022 Outstanding Paper Award)☆51Updated last year
- [Preprint] Why is the State of Neural Network Pruning so Confusing? On the Fairness, Comparison Setup, and Trainability in Network Prunin…☆40Updated last year
- [NeurIPS 2022] Make Sharpness-Aware Minimization Stronger: A Sparsified Perturbation Approach -- Official Implementation☆40Updated last year
- (NeurIPS 2023 spotlight) Large-scale Dataset Distillation/Condensation, 50 IPC (Images Per Class) achieves the highest 60.8% on original …☆112Updated 4 months ago
- Official PyTorch Code for "Is Synthetic Data From Diffusion Models Ready for Knowledge Distillation?" (https://arxiv.org/abs/2305.12954)☆43Updated 9 months ago
- Official Code for NeurIPS 2022 Paper: How Mask Matters: Towards Theoretical Understandings of Masked Autoencoders☆53Updated 10 months ago
- Official implementation for paper "Knowledge Diffusion for Distillation", NeurIPS 2023☆72Updated 7 months ago
- ☆37Updated 7 months ago
- (CVPR 2022) Automated Progressive Learning for Efficient Training of Vision Transformers☆24Updated 2 years ago
- [NeurIPS'22] What Makes a "Good" Data Augmentation in Knowledge Distillation -- A Statistical Perspective☆35Updated last year
- ☆49Updated 11 months ago
- Code release for Deep Incubation (https://arxiv.org/abs/2212.04129)☆90Updated last year
- Official Implementation for PlugIn Inversion☆15Updated 2 years ago
- [ICML2023] Revisiting Data-Free Knowledge Distillation with Poisoned Teachers☆22Updated 2 months ago
- Official code for the paper "Image generation with shortest path diffusion" accepted at ICML 2023.☆20Updated last year
- This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"☆44Updated 3 months ago
- ☆39Updated last year
- ☆52Updated last year
- Repository containing code for blockwise SSL training☆27Updated last year
- This is a PyTorch implementation of the paperViP A Differentially Private Foundation Model for Computer Vision☆38Updated last year
- Towards Unified and Effective Domain Generalization☆28Updated 9 months ago
- PyTorch implementation of "From Sparse to Soft Mixtures of Experts"☆38Updated last year
- PyTorch implementation of paper "Dataset Distillation via Factorization" in NeurIPS 2022.☆61Updated last year
- Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.☆32Updated last year
- Repo for the paper "Extrapolating from a Single Image to a Thousand Classes using Distillation"☆37Updated 2 months ago
- The code of the paper "Minimizing the Accumulated Trajectory Error to Improve Dataset Distillation" (CVPR2023)☆38Updated last year
- Official implementation of AAAI 2023 paper "Parameter-efficient Model Adaptation for Vision Transformers"☆94Updated last year
- ☆98Updated 6 months ago
- Variance Covariance Regularization☆14Updated last year
- Respect to the input tensor instead of paramters of NN☆15Updated 2 years ago