NJUDeepEngine / meteoraLinks

This repository contains the implementation of the paper "MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models".

☆22

Alternatives and similar repositories for meteora

Users that are interested in meteora are comparing it to the libraries listed below

Sorting:

Jiacheng-Zhu-AIML / AsymmetryLoRA
Preprint: Asymmetry in Low-Rank Adapters of Foundation Models
☆36Updated last year
pixeli99 / OWS
Official Pytorch Implementation of "Outlier-weighed Layerwise Sampling for LLM Fine-tuning" by Pengxiang Li, Lu Yin, Xiaowei Gao, Shiwei …
☆34Updated 3 months ago
EnnengYang / AdaMerging
AdaMerging: Adaptive Model Merging for Multi-Task Learning. ICLR, 2024.
☆89Updated 10 months ago
ZO-Bench / ZO-LLM
[ICML‘24] Official code for the paper "Revisiting Zeroth-Order Optimization for Memory-Efficient LLM Fine-Tuning: A Benchmark ".
☆111Updated 2 months ago
AIoT-MLSys-Lab / D2O
[ICLR 2025🔥] D2O: Dynamic Discriminative Operations for Efficient Long-Context Inference of Large Language Models
☆20Updated 2 months ago
SalesforceAIResearch / ThinK
ThinK: Thinner Key Cache by Query-Driven Pruning
☆23Updated 7 months ago
VITA-Group / Junk_DNA_Hypothesis
[ICML 2024] Junk DNA Hypothesis: A Task-Centric Angle of LLM Pre-trained Weights through Sparsity; Lu Yin*, Ajay Jaiswal*, Shiwei Liu, So…
☆16Updated 5 months ago
cliang1453 / task-aware-distillation
Less is More: Task-aware Layer-wise Distillation for Language Model Compression (ICML2023)
☆38Updated 2 years ago
osehmathias / lisa
LISA: Layerwise Importance Sampling for Memory-Efficient Large Language Model Fine-Tuning
☆35Updated last year
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆179Updated last year
VITA-Group / SEAL
Official code for SEAL: Steerable Reasoning Calibration of Large Language Models for Free
☆41Updated 5 months ago
harveyhuang18 / EMR_Merging
[NeurIPS 2024 Spotlight] EMR-Merging: Tuning-Free High-Performance Model Merging
☆69Updated 6 months ago
bethgelab / sober-reasoning
A Sober Look at Language Model Reasoning
☆83Updated last week
abhishekpanigrahi1996 / Skill-Localization-by-grafting
☆51Updated last year
socialfoundations / tttlm
Test-time-training on nearest neighbors for large language models
☆46Updated last year
Xingrun-Xing2 / EfficientLLM
A family of efficient edge language models in 100M~1B sizes.
☆17Updated 7 months ago
EnnengYang / RepresentationSurgery
Representation Surgery for Multi-Task Model Merging. ICML, 2024.
☆46Updated 11 months ago
tanganke / weight-ensembling_MoE
Code for paper "Merging Multi-Task Models via Weight-Ensembling Mixture of Experts"
☆29Updated last year
SempraETY / Pruning-via-Merging
☆20Updated 10 months ago
thu-wyz / inference_scaling
☆74Updated 10 months ago
yxli2123 / LoSparse
☆59Updated last year
aim-uofa / LoRAPrune
☆58Updated 9 months ago
UNITES-Lab / MC-SMoE
[ICLR‘24 Spotlight] Code for the paper "Merge, Then Compress: Demystify Efficient SMoE with Hints from Its Routing Policy"
☆94Updated 3 months ago
Model-GLUE / Model-GLUE
☆15Updated last year
BeyonderXX / TRACE
TRACE: A Comprehensive Benchmark for Continual Learning in Large Language Models
☆79Updated last year
zyxxmu / DSnoT
Official Pytorch Implementation of Our Paper Accepted at ICLR 2024-- Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLM…
☆49Updated last year
tanganke / peta
Code for paper "Parameter Efficient Multi-task Model Fusion with Partial Linearization"
☆21Updated last year
ycjing / Awesome-Model-Merging
A curated list of Model Merging methods.
☆92Updated last year
alvin-zyl / CoLA
Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation
☆23Updated 7 months ago
sail-sg / Attention-Sink
[ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
☆123Updated 2 months ago