TsinghuaC3I / SoRALinks

[EMNLP 2023, Main Conference] Sparse Low-rank Adaptation of Pre-trained Language Models

☆84

Alternatives and similar repositories for SoRA

Users that are interested in SoRA are comparing it to the libraries listed below

Sorting:

harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆72Updated 8 months ago
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated last year
mrflogs / LoRA-Pro
Official code for our paper, "LoRA-Pro: Are Low-Rank Adapters Properly Optimized? "
☆135Updated 7 months ago
Chongjie-Si / Subspace-Tuning
A generalized framework for subspace tuning methods in parameter efficient fine-tuning.
☆160Updated 4 months ago
osehmathias / lisa
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆35Updated last year
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆96Updated last year
GCYZSL / MoLA
☆168Updated last year
yule-BUAA / MergeLLM
Codes for Merging Large Language Models
☆33Updated last year
wutaiqiang / MoSLoRA
☆123Updated last year
Clin0212 / HydraLoRA
[NeurIPS'24 Oral] HydraLoRA: An Asymmetric LoRA Architecture for Efficient Fine-Tuning
☆229Updated 11 months ago
r-three / smear
☆30Updated 2 years ago
ShiZhengyan / DePT
[ICLR 2024] This is the repository for the paper titled "DePT: Decomposed Prompt Tuning for Parameter-Efficient Fine-tuning"
☆98Updated last year
Pbihao / SLM
☆28Updated last year
EnnengYang / RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆46Updated last year
adymaharana / d2pruning
☆42Updated 2 years ago
Chaos96 / fourierft
☆148Updated last year
gyhdog99 / MoCLE
MoCLE (First MLLM with MoE for instruction customization and generalization!) (https://arxiv.org/abs/2312.12379)
☆44Updated 4 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆135Updated 4 months ago
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆186Updated last year
VITA-Group / Random-MoE-as-Dropout
[ICLR 2023] "Sparse MoE as the New Dropout: Scaling Dense and Self-Slimmable Transformers" by Tianlong Chen*, Zhenyu Zhang*, Ajay Jaiswal…
☆55Updated 2 years ago
GATECH-EIC / ACT
[ICML 2024] Unveiling and Harnessing Hidden Attention Sinks: Enhancing Large Language Models without Training through Attention Calibrati…
☆46Updated last year
which47 / LLMCL
Analyzing and Reducing Catastrophic Forgetting in Parameter Efficient Tuning
☆36Updated last year
aim-uofa / LoRAPrune
☆60Updated 11 months ago
lliai / Awesome-Low-Rank-Adaptation
Awesome-Low-Rank-Adaptation
☆122Updated last year
nik-dim / tall_masks
Official repository of "Localizing Task Information for Improved Model Merging and Compression" [ICML 2024]
☆51Updated last year
waltonfuture / Diff-eRank
[NeurIPS 2024] A Novel Rank-Based Metric for Evaluating Large Language Models
☆54Updated 5 months ago
iboing / CorDA
CorDA: Context-Oriented Decomposition Adaptation of Large Language Models for task-aware parameter-efficient fine-tuning(NeurIPS 2024)
☆53Updated 10 months ago
cmnfriend / O-LoRA
☆190Updated last year
DavidFanzz / SCMoE
☆27Updated last year
TUDB-Labs / MixLoRA
State-of-the-art Parameter-Efficient MoE Fine-tuning Method
☆197Updated last year