kallewoof / lora-mergeLinks

A script for merging a LLM model and a LoRA

☆12

Alternatives and similar repositories for lora-merge

Users that are interested in lora-merge are comparing it to the libraries listed below

Sorting:

zarakiquemparte / zaraki-tools
☆27Updated last year
ElleLeonne / Lightning-ReLoRA
A public implementation of the ReLoRA pretraining method, built on Lightning-AI's Pytorch Lightning suite.
☆33Updated last year
bentoml / BentoLMDeploy
Self-host LLMs with LMDeploy and BentoML
☆20Updated 3 weeks ago
nyunAI / PruneGPT
☆53Updated last year
kyegomez / Finetuning-Suite
Finetune any model on HF in less than 30 seconds
☆57Updated 2 months ago
serp-ai / Parameter-Efficient-MoE
Parameter-Efficient Sparsity Crafting From Dense to Mixture-of-Experts for Instruction Tuning on General Tasks
☆31Updated last year
The-Swarm-Corporation / AgentParse
AgentParse is a high-performance parsing library designed to map various structured data formats (such as Pydantic models, JSON, YAML, an…
☆13Updated this week
the-crypt-keeper / the-muse
Experimental sampler to make LLMs more creative
☆31Updated last year
EduardTalianu / EntropixLab
entropix style sampling + GUI
☆26Updated 8 months ago
nexusflowai / NexusBench
Nexusflow function call, tool use, and agent benchmarks.
☆23Updated 6 months ago
attashe / ModifiedBeamSampler
Modified Beam Search with periodical restart
☆12Updated 9 months ago
Digitous / ModelREVOLVER
Model REVOLVER, a human in the loop model mixing system.
☆33Updated last year
geronimi73 / 3090_shorts
minimal scripts for 24GB VRAM GPUs. training, inference, whatever
☆40Updated 2 weeks ago
kaiokendev / cutoff-len-is-context-len
Demonstration that finetuning RoPE model on larger sequences than the pre-trained model adapts the model context limit
☆63Updated 2 years ago
serp-ai / unsloth
5X faster 60% less memory QLoRA finetuning
☆21Updated last year
Gryphe / MergeMonster
An unsupervised model merging algorithm for Transformers-based language models.
☆105Updated last year
ChrisHayduk / qlora-multi-gpu
QLoRA with Enhanced Multi GPU Support
☆37Updated last year
xaedes / llama.cpp
Port of Facebook's LLaMA model in C/C++
☆22Updated last year
agokrani / distillKitPlus
Easy to use, High Performant Knowledge Distillation for LLMs
☆86Updated 2 months ago
AnswerDotAI / ModernBERT-Instruct-mini-cookbook
☆48Updated 4 months ago
IlyasMoutawwakil / py-txi
A Python wrapper around HuggingFace's TGI (text-generation-inference) and TEI (text-embedding-inference) servers.
☆33Updated last month
Zyphra / Zyda_processing
☆35Updated last year
camenduru / Replit-v1-CodeInstruct-3B-colab
☆18Updated last year
matthewrenze / jhu-concise-cot
The Benefits of a Concise Chain of Thought on Problem Solving in Large Language Models
☆22Updated 7 months ago
emrgnt-cmplxty / zero-shot-replication
☆73Updated last year
LLM360 / k2-data-prep
☆20Updated last year
silphendio / sliced_llama
Simple LLM inference server
☆20Updated last year
austinsilveria / tricksy
Fast approximate inference on a single GPU with sparsity aware offloading
☆38Updated last year
official-elinas / zeus-llm-trainer
Zeus LLM Trainer is a rewrite of Stanford Alpaca aiming to be the trainer for all Large Language Models
☆69Updated last year
kyegomez / LM-Infinite
Implementation of "LM-Infinite: Simple On-the-Fly Length Generalization for Large Language Models"
☆41Updated 7 months ago