UbiquitousLearning / Backpropagation_Free_Training_SurveyLinks
☆25Updated last year
Alternatives and similar repositories for Backpropagation_Free_Training_Survey
Users that are interested in Backpropagation_Free_Training_Survey are comparing it to the libraries listed below
Sorting:
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆111Updated 2 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆66Updated 11 months ago
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆19Updated 7 months ago
- A curated list of early exiting (LLM, CV, NLP, etc)☆62Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆257Updated last year
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆36Updated last year
- A curated list of Model Merging methods.☆92Updated last year
- The official implementation of TinyTrain [ICML '24]☆22Updated last year
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆35Updated last year
- ☆27Updated last year
- a curated list of high-quality papers on resource-efficient LLMs 🌱☆139Updated 6 months ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Updated 2 years ago
- ☆35Updated last year
- Collections of paper reviews in SEELab, related to IoT/HD/ML etc.☆32Updated 4 months ago
- ☆58Updated 9 months ago
- Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".☆14Updated 2 years ago
- Efficient LLM Inference Acceleration using Prompting☆50Updated 11 months ago
- Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"☆29Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆89Updated 10 months ago
- summer school materials☆46Updated 2 years ago
- Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs☆18Updated 9 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆34Updated 10 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Updated 6 months ago
- [NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging☆69Updated 6 months ago
- ☆70Updated 9 months ago
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆22Updated 3 months ago
- ☆100Updated last year
- [TKDE'25] The official GitHub page for the survey paper "A Survey on Mixture of Experts in Large Language Models".☆420Updated 2 months ago
- AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION☆14Updated last year