UbiquitousLearning / Backpropagation_Free_Training_SurveyLinks
☆25Updated last year
Alternatives and similar repositories for Backpropagation_Free_Training_Survey
Users that are interested in Backpropagation_Free_Training_Survey are comparing it to the libraries listed below
Sorting:
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆120Updated 5 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆67Updated last year
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆20Updated 10 months ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆38Updated last year
- A curated list of early exiting (LLM, CV, NLP, etc)☆69Updated last year
- A curated list of Model Merging methods.☆94Updated 2 weeks ago
- ☆35Updated last year
- Neural Tangent Kernel Papers☆120Updated 11 months ago
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆52Updated last year
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆63Updated 9 months ago
- Survey Paper List - Efficient LLM and Foundation Models☆258Updated last year
- [TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces☆42Updated 3 years ago
- Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)☆13Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆37Updated last year
- The official implementation of TinyTrain [ICML '24]☆23Updated last year
- [NeurIPS 2021] code for "Taxonomizing local versus global structure in neural network loss landscapes" https://arxiv.org/abs/2107.11228☆19Updated 3 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆382Updated last week
- Code for CVPR paper: Computationally Budgeted Continual Learning: What Does Matter?☆18Updated last year
- AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION☆14Updated 2 years ago
- ☆14Updated last year
- The official implement of paper "Does Federated Learning Really Need Backpropagation?"☆23Updated 2 years ago
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆24Updated 6 months ago
- ☆73Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆97Updated last year
- Welcome to the 'In Context Learning Theory' Reading Group☆30Updated last year
- ☆35Updated 3 years ago
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆36Updated last year
- Official Implementation of LANTERN (ICLR'25) and LANTERN++(ICLRW-SCOPE'25)☆18Updated 9 months ago
- Pytorch code for experiments on Linear Transformers☆24Updated last year
- Selective Aggregation for Low-Rank Adaptation in Federated Learning [ICLR 2025]☆52Updated 8 months ago