nikhil-ghosh-berkeley / loraplusLinks

☆219

Alternatives and similar repositories for loraplus

Users that are interested in loraplus are comparing it to the libraries listed below

Sorting:

catid / dora
Implementation of DoRA
☆296Updated last year
wuhy68 / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆144Updated 9 months ago
nbasyl / DoRA
Official implementation of "DoRA: Weight-Decomposed Low-Rank Adaptation"
☆124Updated last year
Cohere-Labs-Community / parameter-efficient-moe
☆263Updated last year
prateeky2806 / ties-merging
☆183Updated last year
yxli2123 / LoftQ
☆223Updated last year
GraphPKU / PiSSA
PiSSA: Principal Singular Values and Singular Vectors Adaptation of Large Language Models(NeurIPS 2024 Spotlight)
☆361Updated 2 weeks ago
microsoft / rho
Repo for Rho-1: Token-level Data Selection & Selective Pretraining of LLMs.
☆427Updated last year
CASE-Lab-UMD / LLM-Drop
The official implementation of the paper "What Matters in Transformers? Not All Attention is Needed".
☆174Updated 3 months ago
FasterDecoding / BitDelta
☆199Updated 7 months ago
HanGuo97 / lq-lora
☆127Updated last year
Guitaricet / relora
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
☆458Updated last year
locuslab / massive-activations
Code accompanying the paper "Massive Activations in Large Language Models"
☆169Updated last year
kongds / MoRA
MoRA: High-Rank Updating for Parameter-Efﬁcient Fine-Tuning
☆357Updated 11 months ago
Digitous / LLM-SLERP-Merge
Spherical Merge Pytorch/HF format Language Models with minimal feature loss.
☆132Updated last year
astramind-ai / Mixture-of-depths
Unofficial implementation for the paper "Mixture-of-Depths: Dynamically allocating compute in transformer-based language models"
☆167Updated last year
lucidrains / CALM-pytorch
Implementation of CALM from the paper "LLM Augmented LLMs: Expanding Capabilities through Composition", out of Google Deepmind
☆177Updated 10 months ago
jongwooko / distillm
Official PyTorch implementation of DistiLLM: Towards Streamlined Distillation for Large Language Models (ICML 2024)
☆224Updated 4 months ago
jxiw / MambaInLlama
[NeurIPS 2024] Official Repository of The Mamba in the Llama: Distilling and Accelerating Hybrid Models
☆222Updated 2 months ago
dwzhu-pku / PoSE
Positional Skip-wise Training for Efficient Context Window Extension of LLMs to Extremely Length (ICLR 2024)
☆204Updated last year
EricLBuehler / xlora
X-LoRA: Mixture of LoRA Experts
☆231Updated 11 months ago
BorealisAI / flora-opt
This is the official repository for the paper "Flora: Low-Rank Adapters Are Secretly Gradient Compressors" in ICML 2024.
☆104Updated last year
llm-random / llm-random
☆191Updated last week
haonan3 / AnchorContext
AnchorAttention: Improved attention for LLMs long-context training
☆208Updated 6 months ago
yuhuixu1993 / qa-lora
Official PyTorch implementation of QA-LoRA
☆138Updated last year
NVlabs / Minitron
A family of compressed models obtained via pruning and knowledge distillation
☆344Updated 8 months ago
lucidrains / coconut-pytorch
Implementation of 🥥 Coconut, Chain of Continuous Thought, in Pytorch
☆177Updated 3 weeks ago
mlfoundations / scaling
Language models scale reliably with over-training and on downstream tasks
☆97Updated last year
qwopqwop200 / gptqlora
GPTQLoRA: Efficient Finetuning of Quantized LLMs with GPTQ
☆102Updated 2 years ago
SalesforceAIResearch / GemFilter
☆80Updated 6 months ago