UbiquitousLearning / Backpropagation_Free_Training_SurveyLinks

☆25

Alternatives and similar repositories for Backpropagation_Free_Training_Survey

Users that are interested in Backpropagation_Free_Training_Survey are comparing it to the libraries listed below

Sorting:

Yanjun-Zhao / HiZOO
Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer
☆20Updated 9 months ago
ZO-Bench / ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆117Updated 4 months ago
OPTML-Group / DeepZero
[ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…
☆65Updated last year
falcon-xu / early-exit-papers
A curated list of early exiting (LLM, CV, NLP, etc)
☆68Updated last year
Jiacheng-Zhu-AIML / AsymmetryLoRA
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆37Updated last year
LGrCo / L-GreCo
AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION
☆14Updated 2 years ago
UbiquitousLearning / FwdLLM
☆35Updated last year
osehmathias / lisa
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆36Updated last year
theyoungkwon / TinyTrain
The official implementation of TinyTrain [ICML '24]
☆23Updated last year
andyjm3 / SLTrain
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)
☆36Updated last year
AngusDujw / SAF
☆35Updated 2 years ago
UbiquitousLearning / Efficient_Foundation_Model_Survey
Survey Paper List - Efficient LLM and Foundation Models
☆257Updated last year
didizhu-judy / Model-Tailor
[ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models
☆32Updated last year
HazyResearch / fly
☆221Updated 2 years ago
yifanycc / AdaZeta
[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…
☆12Updated 11 months ago
zyushun / hessian-spectrum
Code for the paper: Why Transformers Need Adam: A Hessian Perspective
☆63Updated 8 months ago
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆96Updated last year
imagination-research / EEP
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
☆21Updated 2 weeks ago
vectozavr / llm-hessian
Using PyTorch autograd to compute Hessian of Perplexity for Large Language Models
☆26Updated 7 months ago
nblt / DLDR
[TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces
☆42Updated 3 years ago
reds-lab / LAVA
This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).
☆51Updated last year
cjyaras / deep-lora-transformers
Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)
☆13Updated last year
hmarkc / parallel-prompt-decoding
Efficient LLM Inference Acceleration using Prompting
☆51Updated last year
Pengxin-Guo / FedSA-LoRA
Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025]
☆51Updated 7 months ago
NJUDeepEngine / meteora
This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".
☆24Updated 6 months ago
ml4ai / fed-recon
Federated reconnaissance mini-ImageNet benchmark and baseline models
☆13Updated 4 years ago
hanshen95 / penalized-bilevel-gradient-descent
An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.
☆19Updated 2 years ago
ziyaow1010 / FederatedLLM
☆99Updated 10 months ago
FengHZ / BAFFLE
The official implement of paper "Does Federated Learning Really Need Backpropagation?"
☆23Updated 2 years ago
UbiquitousLearning / Paper-list-resource-efficient-large-language-model
☆101Updated last year