UbiquitousLearning / Backpropagation_Free_Training_SurveyLinks
☆25Updated last year
Alternatives and similar repositories for Backpropagation_Free_Training_Survey
Users that are interested in Backpropagation_Free_Training_Survey are comparing it to the libraries listed below
Sorting:
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆124Updated 6 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆70Updated last year
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆22Updated 11 months ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆70Updated last year
- Neural Tangent Kernel Papers☆121Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆39Updated last year
- Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".☆15Updated 2 years ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆19Updated 2 years ago
- Awesome-Low-Rank-Adaptation☆127Updated last year
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated last year
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Updated 10 months ago
- The official implementation of TinyTrain [ICML '24]☆23Updated last year
- summer school materials☆46Updated 2 years ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆37Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆260Updated last year
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆36Updated last year
- ☆73Updated last year
- A curated list of Model Merging methods.☆96Updated last month
- ☆13Updated last year
- Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models☆27Updated 9 months ago
- Code for CVPR paper: Computationally Budgeted Continual Learning: What Does Matter?☆18Updated last year
- ☆35Updated 3 years ago
- [ICML 2023] Decentralized SGD and Average-direction SAM are Asymptotically Equivalent☆20Updated 2 years ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆100Updated last year
- SparCL: Sparse Continual Learning on the Edge @ NeurIPS 22☆30Updated 2 years ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆25Updated 11 months ago
- AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION☆14Updated 2 years ago
- [DMLR 2024] FedAIoT: A Federated Learning Benchmark for Artificial Intelligence of Things☆59Updated last year
- ☆35Updated last year
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆30Updated last year