UbiquitousLearning / Backpropagation_Free_Training_SurveyLinks
☆23Updated last year
Alternatives and similar repositories for Backpropagation_Free_Training_Survey
Users that are interested in Backpropagation_Free_Training_Survey are comparing it to the libraries listed below
Sorting:
- [ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆105Updated 2 weeks ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆62Updated 9 months ago
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆16Updated 5 months ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆57Updated 11 months ago
- The official implementation of TinyTrain [ICML '24]☆22Updated last year
- A curated list of Model Merging methods.☆92Updated 10 months ago
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆87Updated 8 months ago
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆31Updated last year
- ☆70Updated 7 months ago
- ☆36Updated 2 years ago
- Source code of "Task arithmetic in the tangent space: Improved editing of pre-trained models".☆102Updated 2 years ago
- Neural Tangent Kernel Papers☆115Updated 6 months ago
- Survey Paper List - Efficient LLM and Foundation Models☆252Updated 9 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆59Updated 4 months ago
- ☆33Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆48Updated last year
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆27Updated last year
- ☆54Updated 7 months ago
- AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION☆15Updated last year
- summer school materials☆44Updated last year
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆59Updated 4 months ago
- [NeurIPS‘2021] "MEST: Accurate and Fast Memory-Economic Sparse Training Framework on the Edge", Geng Yuan, Xiaolong Ma, Yanzhi Wang et al…☆18Updated 3 years ago
- Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation☆23Updated 5 months ago
- [NeurIPS 2023 Spotlight] Temperature Balancing, Layer-wise Weight Analysis, and Neural Network Training☆35Updated 3 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆32Updated 8 months ago
- ☆56Updated 7 months ago
- Official PyTorch implementation of "DiffusionNAG: Predictor-guided Neural Architecture Generation with Diffusion Models" (ICLR 2024)☆44Updated last year
- An Numpy and PyTorch Implementation of CKA-similarity with CUDA support☆91Updated 4 years ago
- Federated reconnaissance mini-ImageNet benchmark and baseline models☆13Updated 3 years ago