UbiquitousLearning / Backpropagation_Free_Training_SurveyLinks
☆25Updated last year
Alternatives and similar repositories for Backpropagation_Free_Training_Survey
Users that are interested in Backpropagation_Free_Training_Survey are comparing it to the libraries listed below
Sorting:
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆122Updated 6 months ago
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆20Updated 11 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆69Updated last year
- A curated list of early exiting (LLM, CV, NLP, etc)☆69Updated last year
- [ICML 2024] Model Tailor: Mitigating Catastrophic Forgetting in Multi-modal Large Language Models☆35Updated last year
- summer school materials☆46Updated 2 years ago
- The official implementation of TinyTrain [ICML '24]☆23Updated last year
- A curated list of Model Merging methods.☆94Updated last month
- ☆14Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆259Updated last year
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆38Updated last year
- LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning☆36Updated last year
- Code for CVPR paper: Computationally Budgeted Continual Learning: What Does Matter?☆18Updated last year
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆24Updated 7 months ago
- This is a curated list for Information Bottleneck Principle, in memory of Professor Naftali Tishby.☆383Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆98Updated last year
- AN EFFICIENT AND GENERAL FRAMEWORK FOR LAYERWISE-ADAPTIVE GRADIENT COMPRESSION☆14Updated 2 years ago
- "Efficient Federated Learning for Modern NLP", to appear at MobiCom 2023.☆34Updated 2 years ago
- A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. TPAMI, 2024.☆345Updated this week
- ☆17Updated 3 years ago
- ☆35Updated 3 years ago
- [TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces☆43Updated 3 years ago
- Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".☆15Updated 2 years ago
- ☆35Updated last year
- An Numpy and PyTorch Implementation of CKA-similarity with CUDA support☆94Updated 4 years ago
- EE-LLM is a framework for large-scale training and inference of early-exit (EE) large language models (LLMs).☆73Updated last year
- Awesome-Low-Rank-Adaptation☆126Updated last year
- A collection of research papers on low-precision training methods☆57Updated 8 months ago
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆37Updated last year
- Decentralized SGD and Consensus with Communication Compression: https://arxiv.org/abs/1907.09356☆74Updated 5 years ago