Raiden-Zhu / ICML-2023-DSGD-and-SAM
[ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent
☆16Updated 11 months ago
Related projects ⓘ
Alternatives and complementary repositories for ICML-2023-DSGD-and-SAM
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆24Updated 2 weeks ago
- This is the official implementation of the ICML 2023 paper - Can Forward Gradient Match Backpropagation ?☆10Updated last year
- The official code for ICDM2023 paper: ' FedDIP: Federated Learning with Extreme Dynamic Pruning and Incremental Regularization'☆9Updated 3 months ago
- ☆12Updated 2 years ago
- A simple Jax implementation of influence functions.☆16Updated 7 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆43Updated last month
- Repo to reproduce results for Where to Begin? On the Impact of Pre-Training and Initialization in Federated Learning☆25Updated last year
- Code related to ’Beyond spectral gap: The role of the topology in decentralized learning‘.☆11Updated 2 years ago
- ☆10Updated 2 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆30Updated 8 months ago
- ☆17Updated last year
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆11Updated 3 months ago
- ☆38Updated 3 months ago
- This is an official repository for "Performance Scaling via Optimal Transport: Enabling Data Selection from Partially Revealed Sources" (…☆11Updated last year
- Official Implementation of the CVPR'23 paper 'Regularization of polynomial networks for image recognition'.☆9Updated last year
- summer school materials☆45Updated last year
- (ICML 2023) Feature learning in deep classifiers through Intermediate Neural Collapse: Accompanying code☆13Updated last year
- Code for the paper "Efficient Dataset Distillation using Random Feature Approximation"☆36Updated last year
- Continual learning of task-specific approximations of the parameter posterior distribution via a shared hypernetwork.☆16Updated 2 weeks ago
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆18Updated 2 years ago
- Private Adaptive Optimization with Side Information (ICML '22)☆16Updated 2 years ago
- [ICLR2023] NTK-SAP: Improving neural network pruning by aligning training dynamics☆18Updated last year
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆73Updated 4 months ago
- ☆26Updated last year
- Python implementation of Scaling Neural Tangent Kernels via Sketching and Random Features☆14Updated 2 years ago
- [ICML2022] Training Your Sparse Neural Network Better with Any Mask. Ajay Jaiswal, Haoyu Ma, Tianlong Chen, ying Ding, and Zhangyang Wang☆26Updated 2 years ago
- Prospect Pruning: Finding Trainable Weights at Initialization Using Meta-Gradients☆30Updated 2 years ago
- ☆9Updated last year
- Official PyTorch Implementation for Continual Learning and Private Unlearning☆13Updated 2 years ago
- gradient norm penalty☆38Updated 5 months ago