cjyaras / deep-lora-transformersLinks

Compressible Dynamics in Deep Overparameterized Low-Rank Learning & Adaptation (ICML'24 Oral)

☆13

Alternatives and similar repositories for deep-lora-transformers

Users that are interested in deep-lora-transformers are comparing it to the libraries listed below

Sorting:

yifanycc / loretta
[NAACL 24 Oral] LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
☆36Updated 6 months ago
andyjm3 / SLTrain
SLTrain: a sparse plus low-rank approach for parameter and memory efficient pretraining (NeurIPS 2024)
☆32Updated 9 months ago
alvin-zyl / CoLA
Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆23Updated 5 months ago
pilancilab / Riemannian_Preconditioned_LoRA
source code for paper "Riemannian Preconditioned LoRA for Fine-Tuning Foundation Models"
☆30Updated last year
yifanycc / AdaZeta
[EMNLP 24] Source code for paper 'AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tu…
☆11Updated 7 months ago
yxli2123 / LoSparse
☆59Updated last year
zyushun / hessian-spectrum
Code for the paper: Why Transformers Need Adam: A Hessian Perspective
☆60Updated 4 months ago
VijayLingam95 / SVFT
☆30Updated 5 months ago
MaximeRobeyns / bayesian_lora
Bayesian Low-Rank Adaptation for Large Language Models
☆35Updated last year
mmatena / model_merging
☆71Updated 3 years ago
SempraETY / Pruning-via-Merging
☆20Updated 8 months ago
MohammadrezaBanaei / LoRA-XS
LoRA-XS: Low-Rank Adaptation with Extremely Small Number of Parameters
☆35Updated this week
abdelfattah-lab / TokenButler
☆23Updated last week
osehmathias / lisa
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆33Updated last year
MarlonBecker / MSAM
☆19Updated last year
VITA-Group / Junk_DNA_Hypothesis
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Updated 3 months ago
UKPLab / iclr2024-model-merging
This is the repository for "Model Merging by Uncertainty-Based Gradient Matching", ICLR 2024.
☆28Updated last year
adamxyang / laplace-lora
Bayesian low-rank adaptation for large language models
☆23Updated last year
Qualcomm-AI-research / llm-surgeon
☆29Updated last year
zqOuO / GWT
☆13Updated 6 months ago
Model-GLUE / Model-GLUE
☆15Updated 11 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆173Updated last year
IlanPrice / DCTpS
Code for testing DCT plus Sparse (DCTpS) networks
☆14Updated 4 years ago
ZO-Bench / ZO-LLM
[ICML 2024] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆109Updated last month
hahnyuan / ASVD4LLM
Activation-aware Singular Value Decomposition for Compressing Large Language Models
☆74Updated 9 months ago
VITA-Group / Random-MoE-as-Dropout
[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…
☆53Updated 2 years ago
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated 10 months ago
JeanKaddour / NoTrainNoGain
Revisiting Efficient Training Algorithms For Transformer-based Language Models (NeurIPS 2023)
☆80Updated last year
aim-uofa / LoRAPrune
☆56Updated 7 months ago
pilancilab / matrix-compressor
Implementation of LPLR algorithm for matrix compression
☆29Updated last year