UbiquitousLearning / Backpropagation_Free_Training_SurveyLinks
☆25Updated last year
Alternatives and similar repositories for Backpropagation_Free_Training_Survey
Users that are interested in Backpropagation_Free_Training_Survey are comparing it to the libraries listed below
Sorting:
- [ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".☆113Updated 4 months ago
- Second-Order Fine-Tuning without Pain for LLMs: a Hessian Informed Zeroth-Order Optimizer☆20Updated 8 months ago
- [ICLR'24] "DeepZero: Scaling up Zeroth-Order Optimization for Deep Model Training" by Aochuan Chen*, Yimeng Zhang*, Jinghan Jia, James Di…☆65Updated last year
- A curated list of early exiting (LLM, CV, NLP, etc)☆67Updated last year
- This is an official repository for "LAVA: Data Valuation without Pre-Specified Learning Algorithms" (ICLR2023).☆51Updated last year
- The official implementation of TinyTrain [ICML '24]☆22Updated last year
- Preprint: Asymmetry in Low-Rank Adapters of Foundation Models☆35Updated last year
- ☆17Updated 2 years ago
- ☆36Updated 2 years ago
- ☆19Updated last year
- Neural Tangent Kernel Papers☆119Updated 9 months ago
- Code for the paper: Why Transformers Need Adam: A Hessian Perspective☆65Updated 7 months ago
- An Numpy and PyTorch Implementation of CKA-similarity with CUDA support☆95Updated 4 years ago
- ☆35Updated last year
- ☆71Updated 11 months ago
- PyTorch implementation for Vision Transformer[Dosovitskiy, A.(ICLR'21)] modified to obtain over 90% accuracy FROM SCRATCH on CIFAR-10 wit…☆201Updated last year
- AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.☆94Updated last year
- Official [AAAI] Code Repository for "Continual Learning with Scaled Gradient Projection".☆14Updated 2 years ago
- ☆30Updated last year
- Code repo for the NeurIPS 2021 paper "Online Adaption to Label Distribution Shift".☆15Updated 2 years ago
- Code for coreset selection methods☆245Updated 2 years ago
- An implementation of the penalty-based bilevel gradient descent (PBGD) algorithm and the iterative differentiation (ITD/RHG) methods.☆18Updated 2 years ago
- A curated list of papers of interesting empirical study and insight on deep learning. Continually updating...☆376Updated 2 weeks ago
- Code for CVPR paper: Computationally Budgeted Continual Learning: What Does Matter?☆18Updated last year
- A curated list of Model Merging methods.☆92Updated last year
- Survey Paper List - Efficient LLM and Foundation Models☆258Updated last year
- [TPAMI 2023] Low Dimensional Landscape Hypothesis is True: DNNs can be Trained in Tiny Subspaces☆42Updated 3 years ago
- This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".☆23Updated 5 months ago
- A Comprehensive Survey of Forgetting in Deep Learning Beyond Continual Learning. TPAMI, 2024.☆331Updated 3 weeks ago
- SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)☆35Updated last year